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METHODS OFUSI NG RNASE P REACTION M ECHANISMS OF ACTION 

Field of the Invention ; 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
5 certain of these polynucleotides, molecular complexes of RNAs and polypeptides, the uses 
of such polynucleotides and polypeptides, as well as the production of such polynucleotides 
and polypeptides and recombinant host cells transformed with the polynucleotides. The 
invention relates particularly to such polynucleotides and polypeptides from Staphylococci, 
especially S. aureus. This invention also relates to inhibiting the biosynthesis, assembly or 
10 action of such polynucleotides and/or polypeptides and to the use of such inhibitors in 
therapy. 

Background of the Invention; 

This invention relates to a novel bacterial ribonucleoprotein complex and the 
component parts thereof. More specifically, this invention relates to RNase P, particularly 
15 RNase P from Staphylococcus aureus, and the use of RNase P or components thereof in 
screens for the identification of antimicrobial compounds and to the use of such compounds 
in therapy. 

The Staphylococci make up a medically important genera of microbes. They are 
known to produce two types of disease, invasive and toxigenic. Invasive infections are 

20 characterized generally by abscess formation effecting both skin surfaces and deep tissues. 
S. aureus is the second leading cause of bacteremia in cancer patients. Osteomyelitis, 
septic arthritis, septic thrombophlebitis and acute bacterial endocarditis are also relatively 
common. There are at least three clinical conditions resulting from the toxigenic properties 
of Staphylococci. The manifestation of these diseases result from the actions of exotoxins 

25 as opposed to tissue invasion and bacteremia. These conditions include: Staphylococcal 
food poisoning, scalded skin syndrome and toxic shock syndrome. 

The frequency of Staphylococcus aureus infections has risen dramatically in the past 
20 years. This has been attributed to the emergence of multiply antibiotic resistant strains and 
an increasing population of people with weakened immune systems. It is no longer 

30 uncommon to isolate Staphylococcus aureus strains which are resistant to some or all of the 
standard antibiotics. This has created a demand for both new anti-microbial agents and 
diagnostic tests for this organism. 

While certain Staphylococcal proteins associated with pathogenicity have been 
identified, e.g., coagulase, hemolysins, leucocidins and exo- and enterotoxins, additional 
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targets are always useful because it is appreciated that the target of a antimicrobial screen 
can often bias the outcome. Thus, new targets allow for the discovery of new classes of 
antimicrobials. 

Brief Description of the Invention: 

5 This invention provides a novel ribonucleoprotein complex, particularly such 

complex from Staphylococcus aureus, and the separately isolated RNA and protein 
components thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode the protein and RNA components of such a 
10 complex. 

In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act, for example, as antisense inhibitors of the expression of the 

15 RNA or protein components. The oligonucleotides or fragments or derivatives thereof can 
also be used to directly inhibit catalytic activity or indirectly inhibit activity by interference 
with RNA protein complex formation. The protein and the RNA components, either 
separately or in a complex, are also useful as targets in screens designed to identify 
antimicrobial compounds. 

20 It is an object of the invention to provide polypeptides that have been identified as 

novel RNaseP RNA or holoenzymes by homology between the amino acid sequence set out in 
Table 1 [SEQ ID NO: 2] and a known amino acid sequence or sequences of other proteins, 
such as B. subtilis RNase P protein. 

It is a further object of the invention to provide polynucleotides that encode RNase P 

25 polypeptides, particularly polynucleotides that encode the polypeptide herein designated 
RNaseP, as well as polynucleotides that are transcribed into RNase P RNA, particularly 
catalytic RNA. 

In a particularly preferred embodiment of the invention the polynucleotide comprises 
a region encoding RNaseP polypeptides comprising the sequence set out in Table 1 [SEQ ID 
30 NO: 1 ] which includes a full length gene, or a variant thereof. 

In another particularly preferred embodiment of the invention there is a novel 
RNaseP protein from Staphylococcus aureus comprising the amino acid sequence of Table 1 
[SEQ ID NO:2], or a variant thereof. 
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In accordance with another aspect of the invention there is provided an isolated 
nucleic acid molecule encoding a mature polypeptide expressible by the Staphylococcus 
aureus WCUH 29 strain contained in the deposited strain. 

A further aspect of the invention there are provided isolated nucleic acid molecules 
5 encoding RNaseP, particularly Staphylococcus aureus RNaseP, including mRNAs, cDNAs, 
genomic DNAs and catalytic RNAs. Further embodiments of the invention include 
biologically, diagnostically, prophylactically, clinically or therapeutically useful variants 
thereof, and compositions comprising the same. 

In accordance with another aspect of the invention, there is provided the use of a 
10 polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants of RNaseP and polypeptides encoded thereby. 

Another aspect of the invention there are provided novel polypeptides of 
Staphylococcus aureus referred to herein as RNaseP as well as biologically, diagnostically, 
15 prophylactically, clinically or therapeutically useful variants thereof, and compositions 
comprising the same. 

Among the particularly preferred embodiments of the invention are variants of 
RNaseP polypeptide encoded by naturally occurring alleles of the RNaseP gene. 

In a preferred embodiment of the invention there are provided methods for producing 
20 the aforementioned RNaseP polypeptides. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 

In accordance with certain preferred embodiments of the invention, there are provided 
products, compositions and methods for assessing RNaseP expression, treating disease, for 
25 example, disease, such as, infections of the upper respiratory tract (e.g., otitis media, bacterial 
tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), 
cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, splenic 
abscess, retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, 
conjunctivitis, keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), 
30 kidney and urinary tract (e.g., epididymitis, intrarenal and perinephric absces, toxic shock 
syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, 
bacterial myositis) bone and joint (e.g., septic arthritis, osteomyelitis), assaying genetic 
variation, and administering a RNaseP polypeptide or polynucleotide to an organism to raise 
an immunological response against a bacteria, especially a Staphylococcus aureus bacteria. 
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In accordance with certain preferred embodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to RNaseP polynucleotide 
sequences, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies 
5 against RNaseP polypeptides. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 
polynucleotide of the invention with a compound to be screened under conditions to permit 
10 binding to or other interaction between the compound and the polypeptide or polynucleotide 
to assess the binding to or other interaction with the compound, such binding or interaction 
being associated with a second component capable of providing a detectable signal in 
response to the binding or interaction of the polypeptide or polynucleotide with the 
compound; and determining whether the compound binds to or otherwise interacts with and 
15 activates or inhibits an activity of the polypeptide or polynucleotide by detecting the presence 
or absence of a signal generated from the binding or interaction of the compound with the 
polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided RNaseP 
agonists and antagonists, preferably bacteriostatic or bacteriocidal agonists and antagonists. 
20 In a further aspect of the invention there are provided compositions comprising a 

RNaseP polynucleotide or a RNaseP polypeptide for administration to a cell or to a 
multicellular organism. 

Various changes and modifications within the spirit and scope of the disclosed 
invention will become readily apparent to those skilled in the art from reading the following 
25 descriptions and from reading the other parts of the present disclosure. 
Brief Description of the Drawings; 

Figure 1 illustrates urea-induced denaturation of S. aureus RNaseP protein (pH 
7.0). Circles: the overall intensity of the aliphatic methyl signal in the 1D-*H NMR 
spectrum is plotted versus concentration of urea. Intensities are normalized to 100% for the 
30 denatured form (i.e. at [urea] = 6M) and 0% for the native state (at [urea] = 0). The solid 
line represents a least square fit of the data to a two-state model of protein denaturation. 
Triangles: titration of RNaseP protein at a urea concnetration of [urea] = 1M, corresponding 
to the approximate midpoint of the denaturation curve with increasing concentrations of 
sodium chloride. 
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Figure 2 illustrates NMR-Spectroscopy and sequential assignment of RNaseP 
protein from S. aureus, (a) Amide region of the 2D-[*H,'H]-NOESY spectrum, measured 
at 400MHz. (b) Sequential assignments for the amide- ^H, 15 N, 13 C a , 13 cP positions of 
residues Alal3 to He 18. For each residue given below two strips are shown, selected at the 
5 position of the amide [^H,^N]-spin system and along the 13 C-dimension of the HNCA 

spectrum (lefthand strip of each pair, 600MHz) and the CBCA(CO)NH spectrum (righthand 
strip, 400MHz). Arrows point out sequential C a \.\— »NHj connectivities in the HNCA 
spectrum. Boxes indicate sequential cPj.j and C a j_j— »NHj connectivities in the 
CBCA(CO)NH spectrum. Sample conditions: pH6.0, 500mM NaCl, 26°C. 

10 Figure 3 illustrates survey of parameters for the structure calculation of RNaseP 

protein, plotted against the sequence, (a) Number of long-range distance constraints, (b) 
Average global displacement from the mean conformation calculated for the 20 conformers 
with the lowest DYANA target function. The solid line refers to the polypeptide backbone 
atoms (N, C a , C), the dashed line to all non-hydrogen atoms, (c) Intesity of the 

15 heteronuclear J H — 15 N NOE. Horizontal bars indicate the location of a-helices (hashed 
bar) and p-sheet (filled bars). 

Figure 4 illustrates solution structure of S. aureus RNaseP protein, (a) C a -trace of 
the superposition of the 20 conformers with the lowest DYANA target function out of a 
total of 100 conformers calculated, (b) Schematic drawing of the NMR-structure of RNaseP 
20 protein. Regular secondary structures are represented as arrows (p-sheet) or ribbon (oc- 
helix). Individual side chains are shown for residues with a high level of phylogenetic 
conservation according to the sequence alignment in Figure 5. (c) Plot of the electrostatic 
potential surface. 

Figure 5 illustrates multiple sequence alignment of RNaseP protein from 1 1 
25 different procaryotic species. Numbering and location of regular secondary structures 

(hashed horizontal bars: a-helix; filled bars: (5-sheet) refer to the 5. aureus sequence, which 
is shown in the first line. Highly conserved or conservatively mutated residues are 
underlayed in black (100% conservation) or grey (>70%). Sequences are divided into two 
groups: gram-positive, similar to S.aureus (1-5) and gram-negative, similar to E.coli (6-10). 
30 Hydrophobic core residues that are located between the central P-sheet and helices 2 and 3 
are identified by * (solvent accessiblity of <10% of total surface area, averaged over the 20 
final NMR conformers). Similarly, buried residues in the interface between helix 1 and the 
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p-sheet are indicated by An open rectangle indicates the location of the RNR-box of 
highly conserved basic residues (residues 59-67). The other sequences are ordered from top 
to bottom in the order of decreasing pairwise sequence similarity: Mycoplasma capricolum 
(SwissProt P43039), Bacillus subtilis (P25814), Mycoplasma genitalium (P47703), 
5 Mycoplasma pneumoniae (P75 1 11), E. coli (P06277), Proteus mirabilis (P22835), 

Haemophilus influenzae (P44306), Buchnera aphidicola (P29433), Pseudomonas putida 
(P25752) and Coxiella burnetii (P45648). 

Figure 6 illustrates ^-chemical shift differences of the backbone amide protons as 
a consequence of ligand binding, (a) Binding of mononucleotides or mononucleosides to 

10 RNaseP protein: GMP (diamond), AMP (triangle down), 2 -deoxy-AMP (triangle up), 

equimolar mixture of CMP and UMP (square), equimolar mixture of the cytidinenucleoside 
and uridinenucleoside (circle). Titrations were performed under the following conditions: 
final ligand concentration, 40 mM; protein concentration, 800 uM; 100 mM sodium 
chloride, (b) Titration of RNaseP protein with the following homo-oligonucleotides at two 

15 different salt concentrations: Aiq (filled triangle down), (2 -deoxy-A) jo (filled triangle up. 
Sample conditions for the first two experiments were: ligand concentration 420 uM; protein 
concentration, 640mM, 300 mM sodium chloride); The following two experiments were 
recorded with 800 uM in 100 mM sodium chloride: (deoxy-A) iq (empty triangle up, 1300 
UM, 100 mM sodium chloride), Cjq (square, 1400 pM, 100 mM sodium chloride), Black 

20 horizontal bars at the bottom of the figure indicate the location of the four different protein- 
ligand interaction regions, A-D, along the sequence. All interaction studies were performed 
at pH 7.0, 23 °C. 

Figure7jllustrates substrate binding surface of RNaseP protein as defined by NMR 
spectroscopy: CP-plot of the RNaseP protein structure, superimposed with the electrostatic 

25 potential surface of the residues in the interaction site as characterized by chemical shift 
changes in Fig. 6b. Purple cylinders indicate the position of the a-helices. Approximate 
locations of the 4 major interaction regions (cmp. Figure 6b) are indicated by the letters A- 
D. The orientation of the molecule by approx. 30 degrees clockwise around the vertical axis 
versus the orientation in Fig. 4c to show the partial involvement of helix 3 in ligand 

30 interactions. 
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Glossary: 

The following definitions are provided to facilitate understanding of certain terms 
used frequently herein. Certain other definitions are provided elsewhere herein. 

"Host cell" is a cell which has been transformed or transfected, or is capable of 
5 transformation or transfection by an exogenous polynucleotide sequence. 

"Identity," as known in the art, is a relationship between two or more polypeptide 
sequences or two or more polynucleotide sequences, as determined by comparing the 
sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
10 between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in (Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., Humana 
15 Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., 
M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM J. Applied 
Math., 48: 1073 (1988). Preferred methods to determine identity are designed to give the 
largest match between the sequences tested. Methods to determine identity and similarity 
20 are codified in publicly available computer programs. Preferred computer program 
methods to determine identity and similarity between two sequences include, but are not 
limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 
387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., J. Molec. Biol. 215: 
403-410 (1990). The BLAST X program is publicly available from NCBI and other 
25 sources (BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; 
Altschul, S., et al, J. Mol Biol. 215: 403-410 (1990). As an illustration, by a 
polynucleotide having a nucleotide sequence having at least, for example, 95% "identity" to 
a reference nucleotide sequence of SEQ ID NO: 1 it is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the 
30 polynucleotide sequence may include up to five point mutations per each 100 nucleotides 
of the reference nucleotide sequence of SEQ ID NO: 1. In other words, to obtain a 
polynucleotide having a nucleotide sequence at least 95% identical to a reference 
nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted 
or substituted with another nucleotide, or a number of nucleotides up to 5% of the total 
-7- 
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nucleotides in the reference sequence may be inserted into the reference sequence. These 
mutations of the reference sequence may occur at the 5' or 3' terminal positions of the 
reference nucleotide sequence or anywhere between those terminal positions, interspersed 
either individually among nucleotides in the reference sequence or in one or more 
5 contiguous groups within the reference sequence. Analogously, by a polypeptide having an 
amino acid sequence having at least, for example, 95% identity to a reference amino acid 
sequence of SEQ ID NO:2 is intended that the amino acid sequence of the polypeptide is 
identical to the reference sequence except that the polypeptide sequence may include up to 
five amino acid alterations per each 100 amino acids of the reference amino acid of SEQ ID 
10 NO: 2. In other words, to obtain a polypeptide having an amino acid sequence at least 95% 
identical to a reference amino acid sequence, up to 5% of the amino acid residues in the 
reference sequence may be deleted or substituted with another amino acid, or a number of 
amino acids up to 5% of the total amino acid residues in the reference sequence may be 
inserted into the reference sequence. These alterations of the reference sequence may occur 
15 at the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among residues 
in the reference sequence or in one or more contiguous groups within the reference 
sequence. 

"Isolated" means altered "by the hand of man" from its natural state, i.e., if it occurs 
20 in nature, it has been changed or removed from its original environment, or both. For 
example, a polynucleotide or a polypeptide naturally present in a living organism is not 
"isolated," but the same polynucleotide or polypeptide separated from the coexisting materials 
of its natural state is "isolated", as the term is employed herein. 

"Polynucleotide(s)" generally refers to any polyribonucleotide or 
25 polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
"Polynucleotide(s)" include, without limitation, single- and double-stranded DNA, DNA that 
is a mixture of single- and double-stranded regions or single-, double- and triple-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, 
30 more typically, double-stranded, or triple-stranded regions, or a mixture of single- and double- 
stranded regions. In addition, "polynucleotide" as used herein refers to triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from 
the same molecule or from different molecules. The regions may include all of one or more 
of the molecules, but more typically involve only a region of some of the molecules. One of 
-8- 
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the molecules of a triple-helical region often is an oligonucleotide. As used herein, the term 
"polynucleotide(s)" also includes DNAs or RNAs as described above that contain one or more 
modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other 
reasons are "polynucleotide(s)" as that term is intended herein. Moreover, DNAs or RNAs 
5 comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to name 
just two examples, are polynucleotides as the term is used herein. It will be appreciated that a 
great variety of modifications have been made to DNA and RNA that serve many useful 
purposes known to those of skill in the art. The term "polynucleotide(s)" as it is employed 
herein embraces such chemically, enzymatically or metabolically modified forms of 
10 polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and 
cells, including, for example, simple and complex cells. "Polynucleotide(s)" also embraces 
short polynucleotides often referred to as oligonucleotide(s). 

"Polypeptide(s)" refers to any peptide or protein comprising two or more amino acids 
joined to each other by peptide bonds or modified peptide bonds. "Polypeptide(s)" refers to 
15 both short chains, commonly referred to as peptides, oligopeptides and oligomers and to 
longer chains generally referred to as proteins. Polypeptides may contain amino acids other 
than the 20 gene encoded amino acids. "Polypeptide(s)" include those modified either by 
natural processes, such as processing and other post-translational modifications, but also by 
chemical modification techniques. Such modifications are well described in basic texts and in 
20 more detailed monographs, as well as in a voluminous research literature, and they are well 
known to those of skill in the art. It will be appreciated that the same type of modification 
may be present in the same or varying degree at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Modifications can occur 
anywhere in a polypeptide, including the peptide backbone, the amino acid side-chains, and 
25 the amino or carboxyl termini. Modifications include, for example, acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a 
lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, 
30 formation of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, 
glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, myristoylation, 
oxidation, proteolytic processing, phosphorylation, prenylation, racemization, glycosylation, 
lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation 
and ADP-ribosylation, selenoylation, sulfation, transfer-RNA mediated addition of amino 
-9- 
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acids to proteins, such as arginylation. and ubiquitination. See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman 
and Company, New York (1993) and Wold, F., Posttranslational Protein Modifications: 
Perspectives and Prospects, pgs. 1-12 in POSTTRANSLATIONAL COVALENT 
5 MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic Press, New York (1983); 
Seifter et al., Meth. Enzymol. 752:626-646 (1990) and Rattan et al., Protein Synthesis: 
Posttranslational Modifications and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 (1992). 
Polypeptides may be branched or cyclic, with or without branching. Cyclic, branched and 
branched circular polypeptides may result from post-translational natural processes and may 

10 be made by entirely synthetic methods, as well. 

"Variant(s)" as the term is used herein, is a polynucleotide or polypeptide that 
differs from a reference polynucleotide or polypeptide respectively, but retains essential 
properties. A typical variant of a polynucleotide differs in nucleotide sequence from 
another, reference polynucleotide. Changes in the nucleotide sequence of the variant may 

15 or may not alter the amino acid sequence of a polypeptide encoded by the reference 
polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as 
discussed below. A typical variant of a polypeptide differs in amino acid sequence from 
another, reference polypeptide. Generally, differences are limited so that the sequences of 

20 the reference polypeptide and the variant are closely similar overall and, in many regions, 
identical. A variant and reference polypeptide may differ in amino acid sequence by one or 
more substitutions, additions, deletions in any combination. A substituted or inserted 
amino acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or it 

25 may be a variant that is not known to occur naturally. Non-naturally occurring variants of 
polynucleotides and polypeptides may be made by mutagenesis techniques, by direct 
synthesis, and by other recombinant methods known to skilled artisans. 

Detailed Description of the Invention: 

The ribonucleoprotein, RNase P, plays a key role in the biosynthesis of transfer 
30 RNA (tRNA), itself a key intermediate in protein biosynthesis. RNase P functions to 
process precursor RNAs into mature tRNAs by endoribonucleolytic action. The complex in 
prokaryotes is composed of two subunits: a catalytic RNA and protein co-factor. Recent 
reviews of certain RNase P molecules exist. See for example, L.A. Kirsebom, Molecular 
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Microbiology 17(3). 41 1-420 (1995) or N. R. Pace and J. W. Brown, J. Bacterid . 177 (8). 
1919-1928(1995). 

The invention relates to novel RNaseP polypeptides and polynucleotides as described 
in greater detail below. In particular, the invention relates to polypeptides and 
5 polynucleotides of a novel RNaseP of Staphylococcus aureus, which is related by amino acid 
sequence homology to RNase P polypeptide set forth in SEQ ID NO:2. The invention relates 
especially to RNaseP having the nucleotide and amino acid sequences set out in Table 1, SEQ 
ID NO: 1 and SEQ ID NO: 2 respectively, and to the RNaseP nucleotide sequences of the 
DNA in the deposited strain and amino acid sequences encoded thereby. The invention also 

10 relates to the RNase P RNA component, particularly in its catalytic for, and sequences from 
which such component is transcribed. 
RNase P RNA Component: 

Phylogenetic comparisons readily allow secondary structure modeling and the 
identification of a minimum consensus structure. Data concerning the RNase P RNA 

15 structure are available in the RNase P database on the 
w.w.w.(http://jwbrown.mbio.ncsu.edu/RNaseP/home.html). A polynucleotide of the 
invention from which the RNA component is transcribed is set forth in [SEQ ID NO: 14]. 

In general, few nucleotides are conserved but compensatory base changes in 
hydrogen bonded regions show that the overall structure is preserved throughout eubacteria. 

20 Universal conservation of primary sequences (E. coli : 61-74, 353-360) together with other 
conserved or quasi-conserved nucleotides implicate functional importance, the significance 
of which remain unknown. To date all RNase P RNA molecules can be folded to fit a 
consensus 'cage-like' structure, beyond this domain there is no convincing structural 
similarity between prokaryotic and eukaryotic RNase P RNAs. 

25 RNase P Protein Component: 

The precise functional role of the protein remains unknown. While it is appreciated that in 
vitro the experimental conditions can be established such that the protein component is not 
necessary for catalytic activity, in vivo, the protein components appears to be required. A 
novel RNase P protein component from S. aureus has been identified and is characterized 

30 by the amino acid sequence given in [SEQ ID NO:2] in which the S. aureus (Sau) sequence 
is aligned with sequences from other microorganisms. 

The full length sequence encoding the intact RNase P protein component can be 
obtained by probing a genomic library by for example in situ colony hybridization (infra) 
using a probe(s) generated based on the sequences given in [SEQ ID NOS:l and 2]. 
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TABLE 1 

RNaseP Polynucleotide and Polypeptide Sequences 

5 (A) Sequences from Staphylococcus aureus RNaseP polynucleotide sequence [SEQ ID 
NO:lJ. 

5' -ATG TTA TTG GAA AAA GCT TAC CGA ATT AAA AAG AAT GCA GAT TTT CAG 
AGA ATA TAT AAA AAA GGT CAT TCT GTA GCC AAC AGA CAA TTT GTT GTA TAC 
ACT TGT AAT AAT AAA GAA ATA GAC CAT TTT CGC TTA GGT ATT AGT GTT TCT 
10 AAA AAA CTA GGT AAT GCA GTG TTA AGA AAC AAG ATT AAA AGA GCA ATA CGT 
GAA AAT TTC AAA GTA CAT AAG TCG CAT ATA TTG GCC AAA GAT ATT ATT GTA 
ATA GCA AGA CAG CCA GCT AAA GAT ATG ACG ACT TTA CAA ATA CAG AAT AGT 
CTT GAG CAC GTA CTT AAA ATT GCC AAA GTT TTT AAT AAA AAG ATT AAG TAA- 
3' 

15 

(B) RNaseP polypeptide sequence deduced from the polynucleotide sequence in this 
table [SEQIDNO:2]. 

NH 2 - MLLEK VYRIK KNADF GRIYK KGHSV ANRQF WYTC NNKEI DHFRL GISVS 
KKLGN AVLRN KIKRA IRENF KVHKS HILAR DIIVI ARQPA KDMTT LQIQN SLEHV 
20 LKIAK VFNKK IK-COOH 



(C) Polynucleotide sequence embodiments [SEQ ID NO: 1]. 












x -< R l>n -ATG TTA TTG GAA GCT TAC CGA ATT AAA 


AAG 


AAT 


GCA 


GAT 


TTT 


CAG AGA ATA TAT AAA AAA GGT CAT TCT GTA GCC AAC 


AGA 


CAA 


TTT 


GTT 


GTA 


25 TAC ACT TGT AAT AAT AAA GAA ATA GAC CAT TTT CGC 


TTA 


GGT 


ATT 


AGT 


GTT 


TCT AAA AAA CTA GGT AAT GCA GTG TTA AGA AAC AAG 


ATT 


AAA 


AGA 


GCA 


ATA 


CGT GAA AAT TTC AAA GTA CAT AAG TCG CAT ATA TTG 


GCC 


AAA 


GAT 


ATT 


ATT 


GTA ATA GCA AGA CAG CCA GCT AAA GAT ATG ACG ACT 


TTA 


CAA 


ATA 


CAG 


AAT 


AGT CTT GAG CAC GTA CTT AAA ATT GCC AAA GTT TTT 


AAT 


AAA 


AAG 


ATT 


AAG 


30 TAA-(R 2 ) n -Y 













(D) Polypeptide sequence embodiments [SEQ ID NO:2]. 

X _(R 1 ) n -MLLEK VYRIK KNADF GRIYK KGHSV ANRQF WYTC NNKEI DHFRL GISVS 
KKLGN AVLRN KIKRA IRENF KVHKS HILAR DIIVI ARQPA KDMTT LQIQN SLEHV 
35 LKIAK VFNKK IK-(R 2 ) n - Y 

(E) Sequences from Staphylococcus aureus RNaseP RNA gene [SEQ ID NO:3]. 
5'- 

GTTCTGATATTTTGGGTAATCGCTATATTATATAGAGGAAAGTCCATGCTCACACAGTCTGAGATGAT 
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GTAGTGTTCGTGCTTGATGAAACAATAAATCAAGGCATTAATTTGACGGCAATGAAATATCCTAAGTT 
TTCGATATGGATAGAGTAATTTGAAAGTGCCACAGTGACGTAGCTTTTATAGAAATATAAAAGGTGGA 
CGCGGTAAACCCCTCGAGTGAGCAATCCAAATTTGGTAGGAGCACTTGTTTAACGGAATTCAACGTAT 
AAACGAGACACACTTCGCGAAATGAAGTGGTGTACGACAGATGGTTATCACCTGAGTACCAGTGTGA 
5 CTAGTGCACGTGATGAGTACGATGGAACAGAACATGGCTTATAGAAATATCACTACTA 
G-3' 

(F) Polynucleotide sequence embodiments [SEQ ID NO:3]. 

x-(Ri> n - 

10 GTTCTGATATTTTGGGTAATCGCTATATTATATAGAGGAAAGTCCATGCTCACACAGTCTGAGATGAT 
GTAGTGTTCGTGCTTGATGAAACAATAAATCAAGGCATTAATTTGACGGCAATGAAATATCCTAAGTT 
TTCGATATGGATAGAGTAATTTGAAAGTGCCACAGTGACGTAGCTTTTATAGAAATATAAAAGGTGGA 
CGCGGTAAACCCCTCGAGTGAGCAATCCAAATTTGGTAGGAGCACTTGTTTAACGGAATTCAACGTAT 
AAACGAGACACACTTCGCGAAATGAAGTGGTGTACGACAGATGGTTATCACCTGAGTACCAGTGTGA 

1 5 CTAGTGCACGTGATGAGTACGATGGAACAGAACATGGCTTATAGAAATATCACTACTA 
G-(R 2 ) n -Y 

Polypeptides of the invention: 

The polypeptides of the invention include the polypeptide of Table 1 [SEQ ID NO:2] 
(in particular the mature polypeptide) as well as polypeptides and fragments, particularly 
20 those which have the biological activity of RNaseP, and also those which have at least 70% 
identity to the polypeptide of Table 1 [SEQ ID NO:2] or the relevant portion, preferably at 
least 80% identity to the polypeptide of Table 1 [SEQ ID NO:2], and more preferably at least 
90% similarity (more preferably at least 90% identity) to the polypeptide of Table 1 [SEQ ID 
NO:2] and still more preferably at least 95% similarity (still more preferably at least 95% 
25 identity) to the polypeptide of Table 1 [SEQ ID NO:2] and also include portions of such 
polypeptides with such portion of the polypeptide generally containing at least 30 amino acids 
and more preferably at least 50 amino acids. 

The invention also includes polypeptides of the formula set forth in Table 1 (D) [SEQ 
ID NO:2] wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is 
30 hydrogen or a metal, Rj and R 2 is any amino acid residue, and n is an integer between 1 and 
1000. Any stretch of amino acid residues denoted by either R group, where R is greater than 
1, may be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

A fragment is a variant polypeptide having an amino acid sequence that entirely is the 
same as part but not all of the amino acid sequence of the aforementioned polypeptides. As 
35 with RNaseP polypeptides fragments may be "free-standing," or comprised within a larger 
polypeptide of which they form a part or region, most preferably as a single continuous 
region, a single larger polypeptide. 
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Preferred fragments include, for example, truncation polypeptides having a portion of 
the amino acid sequence of Table 1 [SEQ ID NO:2], or of variants thereof, such as a 
continuous series of residues that includes the amino terminus, or a continuous series of 
residues that includes the carboxyl terminus. Degradation forms of the polypeptides of the 
5 invention in a host cell, particularly a Staphylococcus aureus, are also preferred. Further 
preferred are fragments characterized by structural or functional attributes such as fragments 
that comprise alpha-helix and alpha-helix forming regions, beta-sheet and beta-sheet-forming 
regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 

10 surface-forming regions, substrate binding region, and high antigenic index regions. 

Also preferred are biologically active fragments which are those fragments that 
mediate activities of RNaseP, including those with a similar activity or an improved activity, 
or with a decreased undesirable activity. Also included are those fragments that are antigenic 
or immunogenic in an animal, especially in a human. Particularly preferred are fragments 

15 comprising receptors or domains of enzymes that confer a function essential for viability of 
Staphylococcus aureus or the ability to initiate, or maintain cause disease in an individual, 
particularly a human. 

Variants that are fragments of the polypeptides of the invention may be employed for 
producing the corresponding full-length polypeptide by peptide synthesis; therefore, these 
20 variants may be employed as intermediates for producing the full-length polypeptides of the 
invention. 

Polynucleotides of the invention: 

Another aspect of the invention relates to isolated polynucleotides, including the full 
length gene, that encode the RNaseP polypeptide having a deduced amino acid sequence of 
25 Table 1 [SEQ ID NO:2] and polynucleotides closely related thereto and variants thereof. 

Using the information provided herein, such as a polynucleotide sequence set out in 
SEQ ID NOS: 1, 3, 4 and 14, a polynucleotide of the invention encoding RNaseP polypeptide 
or RNA (such as that transcribed from SEQ ID NO:3) may be obtained using standard cloning 
and screening methods, such as those for cloning and sequencing chromosomal DNA 
30 fragments from bacteria using Staphylococcus aureus WCUH 29 cells as starting material, 
followed by obtaining a full length clone. For example, to obtain a polynucleotide sequence 
of the invention, such as a sequence given in SEQ ID NOS .l , 3, 4 and 14, typically a library 
of clones of chromosomal DNA of Staphylococcus aureus WCUH 29 in E.coli or some 
other suitable host is probed with a radiolabeled oligonucleotide, preferably a 17-mer or 
- 14- 
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longer, derived from a partial sequence. Clones carrying DNA identical to that of the probe 
can then be distinguished using stringent conditions. By sequencing the individual clones 
thus identified with sequencing primers designed from the original sequence it is then 
possible to extend the sequence in both directions to determine the full gene sequence. 

5 Conveniently, such sequencing is performed using denatured double stranded DNA 
prepared from a plasmid clone. Suitable techniques are described by Maniatis, T., Fritsch, 
E.F. and Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed.; 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989). (see in 
particular Screening By Hybridization 1.90 and Sequencing Denatured Double-Stranded 

10 DNA Templates 13.70). Illustrative of the invention, the polynucleotides set out in Table 1 
[SEQ ID NO:l, 4 and 5] were discovered in a DNA library derived from Staphylococcus 
aureus WC\M29. 

Certain DNA sequences set out in Table I [SEQ ID NO:l] contains an open reading 
frame encoding a protein having about the number of amino acid residues set forth in Table 1 
15 [SEQ ID NO:2] with a deduced molecular weight that can be calculated using amino acid 
residue molecular weight values well known in the art. The polynucleotide of SEQ ID NO: 1, 
between nucleotide number 1 through number 351 encodes the polypeptide of SEQ ID NO:2. 
The stop codon begins at nucleotide number 352 of SEQ ID NO: 1 . 

RNaseP of the invention is structurally related to other proteins of the RNase P 
20 family, as shown by the results of sequencing the DNA encoding RNaseP of the deposited 
strain. The protein exhibits greatest homology to B. subtilis protein among known proteins. 
RNaseP of Table 1 [SEQ ID NO:2] has significant identity and similarity over its entire length 
with the amino acid sequence of B. subtilis RNase P polypeptide. 

The invention provides a polynucleotide sequence identical over its entire length to 
25 the coding sequence in Table 1 [SEQ ID NO: 1]. Also provided by the invention is the coding 
sequence for the mature polypeptide or a fragment thereof, by itself as well as the coding 
sequence for the mature polypeptide or a fragment in reading frame with other coding 
sequence, such as those encoding a leader or secretory sequence, a pre-, or pro- or prepro- 
protein sequence. The polynucleotide may also contain non-coding sequences, including for 
30 example, but not limited to non-coding 5' and 3' sequences, such as the transcribed, non- 
translated sequences, termination signals, ribosome binding sites, sequences that stabilize 
mRNA, introns, polyadenylation signals, and additional coding sequence which encode 
additional amino acids. For example, a marker sequence that facilitates purification of the 
fused polypeptide can be encoded. In certain embodiments of the invention, the marker 
- 15 - 
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sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen. Inc.) and 
described in Gentz et ai, Proc. Natl. Acad. ScL, USA 86: 821-824 (1989), or an HA tag 
(Wilson et ah, Cell 37: 767 (1984). Polynucleotides of the invention also include, but are not 
limited to, polynucleotides comprising a structural gene and its naturally associated sequences 
5 that control gene expression. 

A preferred embodiment of the invention is a polynucleotide of comprising 
nucleotide 1 to 351 or 354 set forth in SEQ ID NO:l of Table 1 which encode the RNaseP 
polypeptide. 

The invention also includes polynucleotides of the formula set forth in Table 1 

10 (C)[SEQ ID NO:l and (F)[SEQ ID NO:3] wherein, at the 5' end of the molecule, X is 
hydrogen, and at the 3' end of the molecule, Y is hydrogen or a metal, Rj and R 2 is any 
nucleic acid residue, and n is an integer between 1 and 3000. Any stretch of nucleic acid 
residues denoted by either R group, where R is greater than 1, may be either a heteropolymer 
or a homopolymer, preferably a heteropolymer. A preferred embodiment for the sequence set 

15 forth in Table 1 (F) [SEQ ID NO:3] has Ri or R 2 being between 1 and 10 or 1 and 20, and 
especially being 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10. The invention also provides RNA transcribed 
from such polynucleotides, particularly catalytic RNAs. 

The term "polynucleotide encoding a polypeptide" as used herein encompasses 
polynucleotides that include a sequence encoding a polypeptide of the invention, particularly a 

20 bacterial polypeptide and more particularly a polypeptide of the Staphylococcus aureus 
RNaseP having the amino acid sequence set out in Table 1 [SEQ ID NO:2]. The term also 
encompasses polynucleotides that include a single continuous region or discontinuous regions 
encoding the polypeptide (for example, interrupted by integrated phage or an insertion 
sequence or editing) together with additional regions, that also may contain coding and/or 

25 non-coding sequences. 

The invention further relates to variants of the polynucleotides described herein that 
encode for variants of the polypeptide having the deduced amino acid sequence of Table 1 
[SEQ ID NO:2]. Variants that are fragments of the polynucleotides of the invention may be 
used to synthesize full-length polynucleotides of the invention. 
30 In addition to the standard A, G, C, TAJ representations for nucleic acid bases, the 

term "N" is also used. "N" means that any of the four DNA or RNA bases may appear at 
such a designated position in the DNA or RNA sequence, except that, in preferred 
embodiments, N can not be a base that when taken in combination with adjacent nucleotide 
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positions, when read in the correct reading frame, would have the effect of generating a 
premature termination codon in such reading frame. 

Further particularly preferred embodiments are polynucleotides encoding RNaseP 
variants, that have the amino acid sequence of RNaseP polypeptide of Table 1 [SEQ ID NO:2] 
5 in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are substituted, 
deleted or added, in any combination. Especially preferred among these are silent 
substitutions, additions and deletions, that do not alter the properties and activities of RNaseP. 

Further preferred embodiments of the invention are polynucleotides that are at least 
50%, 60% or 70% identical over their entire length to a polynucleotide encoding RNaseP 

10 polypeptide having an amino acid sequence set out in Table 1 [SEQ ID NO:2], and 
polynucleotides that are complementary to such polynucleotides. Alternatively, most highly 
preferred are polynucleotides that comprise a region that is at least 80% identical over its 
entire length to a polynucleotide encoding RNaseP polypeptide of the deposited strain and 
polynucleotides complementary thereto. In this regard, polynucleotides at least 90% identical 

15 over their entire length to the same are particularly preferred, and among these particularly 
preferred polynucleotides, those with at least 95% are especially preferred. Furthermore, 
those with at least 97% are highly preferred among those with at least 95%, and among these 
those with at least 98% and at least 99% are particularly highly preferred, with at least 99% 
being the more preferred. 

20 Preferred embodiments of the invention are polynucleotides that are at least 50%, 

60% or 70% identical over their entire length to an RNaseP polynucleotide having a 
nucleotide sequence set out in SEQ ID NO:3, 4 or 14, and polynucleotides that are 
complementary to such polynucleotides. Alternatively, most highly preferred are 
polynucleotides that comprise a region that is at least 80% identical over its entire length to an 

25 RNaseP polynucleotide of the deposited strain and polynucleotides complementary thereto. 
In this regard, polynucleotides at least 90% identical over their entire length to the same are 
particularly preferred, and among these particularly preferred polynucleotides, those with at 
least 95% are especially preferred. Furthermore, those with at least 97% are highly preferred 
among those with at least 95%, and among these those with at least 987c and at least 99% are 

30 particularly highly preferred, with at least 99% being the more preferred. It is especially 
preferred that these polynucleotides be RNAs, especially catalytic RNAs. 

Preferred embodiments are polynucleotides that encode polypeptides that retain 
substantially the same biological function or activity as the mature polypeptide encoded by 
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the DNA of Table 1 [SEQ ID NO: 1 ] or as the RNase P RNA component transcribed by the 
DNA of SEQ ID NO: 3, 4 or 14. 

The invention further relates to polynucleotides that hybridize to the herein above- 
described sequences. In this regard, the invention especially relates to polynucleotides that 

5 hybridize under stringent conditions to the herein above-described polynucleotides. As herein 
used, the terms "stringent conditions" and "stringent hybridization conditions" mean 
hybridization will occur only if there is at least 95% and preferably at least 97% identity 
between the sequences. An example of stringent hybridization conditions is overnight 
incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150mM NaCl, 

10 15mM trisodium citrate), 50 raM sodium phosphate (pH7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed by 
washing the hybridization support in 0.1 x SSC at about 65°C. Hybridization and wash 
conditions are well known and exemplified in Sambrook, et al.. Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), particularly 

15 Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID NO:3 or 
SEQ ID NO:4 under stringent hybridization conditions with a probe having the sequence of 

20 said polynucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID NO:3 or SEQ ID NO:4 
respectively or a fragment thereof; and isolating said DNA sequence. Fragments useful for 
obtaining such a polynucleotide include, for example, probes and primers described 
elsewhere herein. 

As discussed herein regarding polynucleotide assays of the invention, for instance, 
25 polynucleotides of the invention as discussed above, may be used as a hybridization probe for 
RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic clones encoding 
RNaseP and to isolate cDNA and genomic clones of other genes that have a high sequence 
similarity to the RNaseP gene. Such probes generally will comprise at least 15 bases. 
Preferably, such probes will have at least 30 bases and may have at least 50 bases. 
30 Particularly preferred probes will have at least 30 bases and will have 50 bases or less. 

Polynucleotides of the invention that are oligonucleotides derived from the 
sequences of SEQ ID NOS:l and/or 2 and/or 3 and/or 4 and/or 14 may be used in the 
processes herein as described, but preferably for PCR, to determine whether or not the 
polynucleotides identified herein in whole or in part are transcribed in bacteria in infected 
- 18- 
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tissue. It is recognized that such sequences will also have utility in diagnosis of the stage of 
infection and type of infection the pathogen has attained. 

The invention also provides polynucleotides that may encode a polypeptide that is the 
mature protein plus additional amino or carboxyl-terminal amino acids, or amino acids 
5 interior to the mature polypeptide (when the mature form has more than one polypeptide 
chain, for instance). Such sequences may play a role in processing of a protein from precursor 
to a mature form, may allow protein transport, may lengthen or shorten protein half-life or 
may facilitate manipulation of a protein for assay or production, among other things. As 
generally is the case in vivo, the additional amino acids may be processed away from the 
10 mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 
15 In sum, a polynucleotide of the invention may encode a mature protein, a mature 

protein plus a leader sequence (which may be referred to as a preprotein), a precursor of a 
mature protein having one or more prosequences that are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a proprotein, having a leader sequence 
and one or more prosequences, which generally are removed during processing steps that 
20 produce active and mature forms of the polypeptide. 
Cloning of 5~. aureus RNase P RNA structural gene: 

A partial homolog to B. subdlis RNase P RNA was identified in a proprietary S. 
aureus database comprising sequences from randomly sequenced S. aureus DNA library in 
an E. coli host. 

25 A PCR primer based on these data was designed to the 3' end of the gene (primer: 

5'-cgc GAA gtg tgt ctc gtt tat acg-3 ' ) [SEQ ID NO:5] and a second based on 
a universally conserved sequence within the 5' domain (5 '-gag gaa agt CCA TGC TC- 
3 ' } [SEQ ID NO:6] ( available from the RNase P database w.w.w.) permitted recovery of 
approximately 90% of the gene. The complete structural gene was amplified using a 

30 degenerate primer (5 ' - c / t GATATTTCV t gV g taa t / c C-3 ' ) [SEQ ID NO:13] that would 
allow the RNA product to form the predicted helices PI and P2. The structural gene has 
been cloned behind a T7 promoter, sequenced and shown to be highly related to B. subtilis 
homolog. The precise S. aureus genomic sequence encoding (transcribing) helices PI and 
P2 may be determined by skilled artisans using methods and compounds of the invention, 
- 19- 
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such as the degenerate primer described above [SEQ ID NO: 13]. The exact 5' end has 
been determined by primer extension analysis using endogenous S. aureus RNA isolated 
from 5. aureus WCUH29. The sequence of the first 20 nucleotides of RNaseP RNA 5' end 
as determined by this primer extension analysis is: 
5 5 ' -GUUCUGAUAUUUUGGGUAAU-3 ' [SEQ ID NO: 14]. 

Further Description and Definitions: 

The coding region of the RNaseP gene may be isolated, for example, by screening 

using a deposit containing a Staphylococcus aureus WCUH 29 strain which has been 

deposited with the National Collections of Industrial and Marine Bacteria Ltd. (herein 
10 "NCIMB"), 23 St. Machar Drive, Aberdeen AB2 1RY, Scotland on 1 1 September 1995 and 

assigned NCIMB Deposit No. 40771 . It was referred to as Staphylococcus aureus WCUH29 

on deposit. The Staphylococcus aureus strain deposit is referred to herein as "the deposited 

strain" or as "the DNA of the deposited strain." 

The deposited strain contains the full length RNaseP gene. The sequence of the 
15 polynucleotides contained in the deposited strain, as well as the amino acid sequence of the 

polypeptide encoded thereby, are controlling in the event of any conflict with any description 

of sequences herein. 

The deposit of the deposited strain has been made under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for Purposes of 

20 Patent Procedure. The strain will be irrevocably and without restriction or condition released 
to the public upon the issuance of a patent. The deposited strain is provided merely as 
convenience to those of skill in the art and is not an admission that a deposit is required for 
enablement, such as that required under 35 U.S.C. §112. 

A license may be required to make, use or sell the deposited strain, and compounds 

25 derived therefrom, and no such license is hereby granted. 

The nucleotide sequences disclosed herein can also be obtained by synthetic 
chemical techniques known in the art or can be obtained from S. aureus WCUH 29 by 
probing a DNA preparation with probes constructed from the particular sequences disclosed 
herein. Alternatively, oligonucleotides derived from a disclosed sequence can act as PCR 

30 primers in a process of PCR-based cloning of the sequence from a bacterial genomic 
source. It is recognized that such sequences will also have utility in diagnosis of the type of 
infection the pathogen has attained. 
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A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 
5 may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
10 polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
15 controlling transport of a polypeptide from the cell. The polypeptide having a leader 
sequence is a preprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and is an inactive form of the protein. Once the prosequence 
20 is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may encode for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH^-terminus. It is appreciated, 
25 however, that during post-translational modification of the peptide, this residue may be 
deleted. Accordingly, this invention contemplates the use of both the methionine- 
containing and the methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence at either the 5' or 3' terminus of the gene which allows 
30 for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by 
Quiagen Inc.) to provide for purification of the polypeptide fused to the marker in the case 
of a bacterial host. Alternatively the maltose binding protein (MBP) fusion system may be 
employed. In this system the gene of interest is fused the malE gene encoding the MBP 
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(supplied by New England BioLabs). The fusion product is purified in a one step procedure 
based on the MBP affinity for maltose. A pre-engineered Xa cleavage site allows for 
efficient removal of the MBP component from the gene product of interest. 

In order to facilitate understanding of the following example certain frequently 
5 occurring methods and/or terms will be described. 

"Plasmids" are designated by a lower case p preceded and/or followed by capital 
letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
10 known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
15 purposes, typically 1 ug of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 pi of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 ug of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction 
enzymes are specified by the manufacturer. Incubation times of about 1 hour at 37°C are 
20 ordinarily used, but may vary in accordance with the supplier's instructions. After digestion 
the reaction is electrophoresed directly on an agarose gel to isolate the desired fragment. 
Size separation of the cleaved fragments is generally performed using a 1 % percent agarose 
gel. 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
25 complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between two 
30 double stranded nucleic acid fragments (Maniatis, T., et al, supra., p. 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase ("ligase") per 0.5 ug of approximately equimolar amounts of the 
DNA fragments to be ligated. 
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The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
functions as an autonomous unit of DNA replication in vivo; i.e., capable of replication 
5 under its own control. 

A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 

A "double-stranded DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
10 helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., 
restriction fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according 
15 to the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the 
mRNA). 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
20 placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3' terminus by a translation start codon (e.g., ATG) of a coding sequence and extends 
25 upstream (5' direction) to include the minimum number of bases or elements necessary to 
initiate transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
30 boxes and "CAT" boxes. Prokaryotic promoters contain the -10 and -35 consensus 
sequences. 

DNA "control sequences" refers collectively to promoter sequences, ribosome 
binding sites, polyadenylation signals, transcription termination sequences, upstream 
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regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e., the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 
RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
5 mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 

A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
10 (covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
15 stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 

A "clone" is a population of cells derived from a single cell or common ancestor by 
mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro 
for many generations. 

20 A "heterologous" region of a DNA construct is an identifiable segment of DNA 

within or attached to another DNA molecule that is not found in association with the other 
molecule in nature. 

Preparation of the RNase P protein component: 

The present invention also relates to vectors which include polynucleotides of the 
25 present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

In accordance with yet a further aspect of the present invention, there is therefore 
provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
30 recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Host cells are genetically engineered (transduced or transformed or transfected) 
with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the form of a plasmid, a cosmid, a 
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phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
genes. The culture conditions, such as temperature, pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
5 artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

10 For recombinant production, host cells can be genetically engineered to incorporate 

expression systems or portions thereof or polynucleotides of the invention. Introduction of a 
polynucleotide into the host cell can be effected by methods described in many standard 
laboratory manuals, such as Davis et al, BASIC METHODS IN MOLECULAR BIOLOGY, 
(1986) and Sambrook et al, MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., 

15 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), such as, calcium 
phosphate transfection, DEAE-dextran mediated transfection, transvection, microinjection, 
cationic lipid-mediated transfection, electroporation, transduction, scrape loading, ballistic 
introduction and infection. 

Representative examples of appropriate hosts include bacterial cells, such as 

20 streptococci, staphylococci, enterococci E. coli, streptomyces and Bacillus subtilis cells; 
fungal cells, such as yeast cells and Aspergillus cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 and 
Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used to produce the polypeptides of the 

25 invention. Such vectors include, among others, chromosomal, episomal and virus-derived 
vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from transposons, 
from yeast episomes, from insertion elements, from yeast chromosomal elements, from 
viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, 
fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from 

30 combinations thereof, such as those derived from plasmid and bacteriophage genetic 
elements, such as cosmids and phagemids. The expression system constructs may contain 
control regions that regulate as well as engender expression. Generally, any system or vector 
suitable to maintain, propagate or express polynucleotides and/or to express a polypeptide in a 
host may be used for expression in this regard. The appropriate DNA sequence may be 
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inserted into the expression system by any of a variety of well-known and routine techniques, 
such as, for example, those set forth in Sambrook et aL MOLECULAR CLONING, A 
LABORATORY MANUAL, {supra). 

The appropriate DNA sequence may be inserted into the vector by a variety of 
5 procedures. In general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli. 
10 lac or trp, the phage lambda P L promoter and other promoters known to control expression 
of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector may also 
contains a ribosome binding site for translation initiation and/or a transcription terminator. 
The vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable 
15 marker genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as 
20 "control" elements), so that the DNA sequence encoding the desired protein is transcribed 
into RNA in the host cell transformed by a vector containing this expression construction. 
The coding sequence may or may not contain a signal peptide or leader sequence. The 
polypeptides of the present invention can be expressed using, for example, the E. coli tac 
promoter or the protein A gene (spa) promoter and signal sequence. Leader sequences can 
25 be removed by the bacterial host in post-translational processing. See, e.g., U.S. Patent 
Nos. 4,431,739; 4,425,437; 4,338,397. Promoter regions can be selected from any desired 
gene using CAT (chloramphenicol transferase) vectors or other vectors with selectable 
markers. Two appropriate vectors are PKK232-8 and PCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P R , P L and trp. Eukaryotic promoters 
30 include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from 
retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and promoter is 
well within the level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
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of the host cell. Regulatory sequences are known to those of skill in the art, and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
5 sequences. 

An expression vector is constructed so that the particular coding sequence is 
located in the vector with the appropriate regulatory sequences, the positioning and 
orientation of the coding sequence with respect to the control sequences being such that the 
coding sequence is transcribed under the "control" of the control sequences (i.e., RNA 
10 polymerase which binds to the DNA molecule at the control sequences transcribes the 
coding sequence). Modification of the coding sequences may be desirable to achieve this 
end. For example, in some cases it may be necessary to modify the sequence so that it may 
be attached to the control sequences with the appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
15 coding sequence prior to insertion into a vector, such as the cloning vectors described 
above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
20 gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
25 Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 
identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
30 appropriate host to permit the host to express the protein. 

More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
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embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 

5 pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, 
P NH8A, P NH16a, P NH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia). Eukaryotic: pBlueBacIII (Invitrogen), pWLNEO, 
pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). 
However, any other plasmid or vector may be used as long as they are replicable and viable 

10 in the host. 

Examples of recombinant DNA vectors for cloning and host cells which they can 
transform include the bacteriophage 1 (£. coli), pBR322 (£. coli), pACYC177 (£. coli), 
pKT230 (gram-negative bacteria), pGV1106 (gram-negative bacteria), pLAFRl (gram- 
negative bacteria), pME290 (non-£. coli gram-negative bacteria), pHV14 (£. coli and 
15 Bacillus subtilis), pBD9 (Bacillus), pIJ61 (Streptomyces), pUC6 (Streptomyces), YIp5 
(Saccharomyces), a baculovirus insect cell system, YCpl9 (Saccharomyces). See, 
generally, "DNA Cloning": Vols. I & II, Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of 
20 the polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

Polypeptides can be expressed in host cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
25 by Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold 
Spring Harbor, N.Y., (1989), the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
30 Cells are typically harvested by centrifugation, disrupted by physical or chemical 

means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or 
use of cell lysing agents, such methods are well known to those skilled in the art. 
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Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
5 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 

10 inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression, it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 

15 methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 

20 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 
Preparation of the RNase P RNA Component: 

25 The RNase P RNA molecules are prepared by run-off in vitro transcription using 

T7 RNA polymerase as according to standard conditions - usually as recommended by the 
supplier, e.g., Promega. The plasmid is linearized with an appropriate restriction enzyme 
generating a linear dsDNA comprising the full length gene encoding the RNase PRNA. 
The RNA is purified either from a preparative denaturing acrylamide gel or is precipitated 

30 prior to use in in vitro cleavage assays. The substrates for the RNase P RNA and the RNA 
complexed with its protein (RNase P protein) can be obtained by in vitro transcription of 
cloned genes. Useful substrates included but are not limited to pre-tRNA^et or £• co // or 
B. subtilis pre-4.5S molecules and may be expressed using an in vitro transcription system 
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directed by T7 RNA polymerase as described above. The RNA can also be prepared by 
automated synthesis. 

Antagonists and agonists - assays and molecules 

This invention provides a method of screening drugs to identify those which 
5 interfere with the RNA portion, the protein portion and/ or the intact RNA/protein complex 
of the RNase P described herein, which method comprises measuring the interference of the 
activity of the protein and/or RNA by a test drug. For example since the RNA portion 
selected has a catalytic activity, after suitable purification and formulation the activity of 
the RNA can be followed by its ability to convert its natural or synthetic RNA substrates. 
10 By incorporating different chemically synthesized test compounds or natural products into 
such an assay of enzymatic activity one is able to detect those additives which compete 
with the natural or synthetic substrate or otherwise inhibit enzymatic activity. 

Polypeptides of the invention may also be used to assess the binding of small 
molecule substrates and ligands in, for example, cells, cell-free preparations, chemical 
15 libraries, and natural product mixtures. These substrates and ligands may be natural substrates 
and ligands or may be structural or functional mimetics. See, e.g., Coligan et al, Current 
Protocols in Immunology 1(2): Chapter 5 (1991). 

The invention also provides a method of screening compounds to identify those 
which enhance (agonist) or block (antagonist) the action of RNaseP polypeptides or 
20 polynucleotides, particularly those compounds that are bacteriostatic and/or bacteriocidal. 
The method of screening may involve high-throughput techniques. For example, to screen for 
agonists or antagonists, a synthetic reaction mix, a cellular compartment, such as a membrane, 
cell envelope or cell wall, or a preparation of any thereof, comprising RNaseP polypeptide 
and a labeled substrate or ligand of such polypeptide is incubated in the absence or the 
25 presence of a candidate molecule that may be a RNaseP agonist or antagonist. The ability of 
the candidate molecule to agonize or antagonize the RNaseP polypeptide is reflected in 
decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously, i.e., without inducing the effects of RNaseP 
polypeptide are most likely to be good antagonists. Molecules that bind well and increase the 
30 rate of product production from substrate are agonists. Detection of the rate or level of 
production of product from substrate may be enhanced by using a reporter system. Reporter 
systems that may be useful in this regard include but are not limited to colorimetric labeled 
substrate converted into product, a reporter gene that is responsive to changes in RNaseP 
polynucleotide or polypeptide activity, and binding assays known in the art. 
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Another example of an assay for RNaseP antagonists is a competitive assay that 
combines RNaseP and a potential antagonist with RNaseP-binding molecules, recombinant 
RNaseP binding molecules, natural substrates or ligands, or substrate or ligand mimetics, 
under appropriate conditions for a competitive inhibition assay. RNaseP can be labeled, such 
5 as by radioactivity or a colorimetric compound, such that the number of RNaseP molecules 
bound to a binding molecule or converted to product can be determined accurately to assess 
the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or 

10 extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
molecule, such as a binding molecule, without inducing RNaseP-induced activities, thereby 
preventing the action of RNaseP by excluding RNaseP from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding 

15 site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not 
limited to small organic molecules, peptides or peptide-like molecules. Other potential 
antagonists include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OLIGODEOXYNU CLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, 

20 CRC Press, Boca Raton, FL (1988), for a description of these molecules). Preferred potential 
antagonists include compounds related to and variants of RNaseP. 

Each of the DNA sequences provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 

25 encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 
translation facilitating sequences of the respective mRNA can be used to construct 
antisense sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 

30 mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds; to block RNaseP protein-mediated mammalian cell 
invasion by, for example, initiating phosphorylation of mammalian tyrosine kinases 
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(Rosenshine et al, Infect. Immun. 60:2211 (1992); to block bacterial adhesion between 
mammalian extracellular matrix proteins and bacterial RNaseP proteins that mediate tissue 
damage and; to block the normal progression of pathogenesis in infections initiated other 
than by the implantation of in-dwelling devices or by other surgical techniques. 
5 The antagonists and agonists of the invention may be employed, for instance, to 

inhibit and treat disease, such as, infections of the upper respiratory tract (e.g., otitis media, 
bacterial tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung 
abscess), cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, 
splenic absces, retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, 

10 conjunctivitis, keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), 
kidney and urinary tract (e.g., epididymitis, intrarenal and perinephric absces, toxic shock 
syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, 
bacterial myositis) bone and joint (e.g., septic arthritis, osteomyelitis). 
HTP Screening Strategies: 

15 Assays can be developed to detect compounds that inhibit RNase P directed 

cleavage of RNA substrates. Several possible assay formats are suitable for HTP screening 
based upon the ability to incorporate labels within the RNA in a site-specific fashion by 
chemical synthesis. The conventional radioactive-based format is preferred, while a 
homogeneous fluorescence-based format is useful for subsequent follow-up of lead 

20 compounds. The use of both formats is contemplated by this invention. 
Functional RNase P Assay 

Biotin is introduced in to an appropriate position within the RNA substrate and the 
5' terminus labeled with 3 2p. The substrate is linked via streptavidin within a 96-well plate. 
Following RNase P dependent hydrolysis of the substrate, the radiolabeled 5' leader 

25 cleavage product is released into the bulk solution phase, and subject to scintillation 
counting. Alternatively, the RNA substrate is bound to a streptavidin-coated flashplate such 
that the release of the radioactive 6-mer into the solution phase results in a decrease in 
signal. This has the advantage that it is a homogeneous, continuous assay format and 
requires no additional manipulations after starting the assay. Both formats are useful in the 

30 practice of this invention because they use the same RNA substrate. 
RNA Fragment Library Rescue 

An effective approach for identifying compounds that interact with RNA is 
contemplated. The concept is based on the over-expression of a drug binding site that is 
recreated on an RNA fragment, which will sequester the drug and permit the continued 
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functioning of the intact ribozyme. This approach has recently been described in the context 
of a search for ligands that bind ribosomal RNA (Howard, B-A, et al., Biochem. Cell Bio. 
73(11/12): 1161-1166 (1995)). Following selection the random RNA fragments that 
apparently present a minimal target structure for drug recognition, are incorporated into a 
5 protocol for rational drug design. Accordingly, random fragment libraries based on Ml 
RNA will be generated and used in HTP screening to identify compounds that disrupt 
RNA/protein interaction. 
Cyclic peptide phage libraries: 

The incorporation of conformational constraints into flexible lead compounds is a 
10 powerful strategy to increase lead potency and is particularly useful in the field of 
peptidomimetic design.( Al-Obeidi, F. et al., J. Med. Chem . 32: 2555-2561(1989);Barker, 
P.L., et al., J. Med. Chem. 35: 2040-2048 (1992)). Cyclization has been shown to increase 
the propensity for beta-turn formation in peptides the potential of which has been 
demonstrated by the identification of high-affinity ligands for streptavidin (Lee, M. S., et 
15 al. FEBS Lett . 359: 1 13-118 (1995)). 

In this case, cyclic peptide libraries were constructed with flanking cysteine residues to 
allow efficient disulfide bond formation and cyclization during phage assembly. The 
streptavidin bound crystal structures of two disulfide bridged cyclic peptides showed both 
peptides to be in beta-turn conformations (Kahn, M. (Guest, Ed., 1993) Tetrahedron 49, 
20 Symp. 50, 3433-3677). 

Beta-turns are key recognition elements in many biological interactions therefore 
effort has been focused on the design of small constrained beta-turn mimics (Kahn, M. 
(Guest Ed., 1993) Tetrahedron 49 , Symp. 50, 3433-3677). This approach, when applied to 
RNase P, could identify cyclic peptides suitable for peptide mimic synthesis as inhibitor 
25 molecules. A cyclic octapeptide phage display library may be constructed and used to 
identify peptides that interact with defined RNA domains. 
Secondary Evaluation 

Systematic Evolution of Ligands by Exponentia l Enrichment: 

This approach may be employed in an attempt to identify RNA recognition 
30 motifs for RNase P RNA (Ml RNA) protein binding for structural analysis as an aid to 
rational drug design and the secondary evaluation of compounds identified via the HTP 
screens. The technology is based on the repeated selection and amplification of RNA 
fragments that specifically bind to a protein with high affinities (Szostak, J.W., TJBS 17: 
89-93 (1992)). Fragment libraries based on the S. aureus and E. coli RNase P RNAs may 
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be constructed for the in vitro synthesis of RNA fragments and the subsequent selection of 
molecules that bind their respective proteins. Chemical and enzymatic structure probing 
technologies may be employed in combination with protein/RNA protection studies to map 
the interactive sites. SELEX based on the resulting RNA fragment(s) may be further 
5 exploited to determine the minimal structural requirements for RNA recognition. 
Disruption of RNase P Assembly 

The identification of a protein/RNA-fragment pair permits the development of a 
screen for compounds that disrupt their assembly. Drug induced disruption of labeled RNA 
bound to immobilized protein (biotin/streptavidin) would result in the concomitant 

10 decrease/loss of the signal generated by the presence of the RNA. 
RN A/Drug Interactions 

RNase P RNA fragments that confer drug resistance (RNA Fragment Rescue 
Library supra) may be sequenced and expressed in vitro for chemical and enzymatic 
structure probing in the presence and absence of the drug in an attempt to map the binding 

15 site. SELEX may be applied to lead compounds in an attempt to identify the minimal 
structural requirements for drug binding. 
RNase P Substrates 

Minimal RNA substrates may be chemically synthesized for HTP screening 
including both pre-tRNA and pre-4.5S RNA derivatives. RNA-Iigand interactions involving 

20 ribose 2-hydroxyl groups of specific nucleotides may be probed via chemical synthesis of 
the appropriately modified RNA fragment. In order to retain the C-endo configuration 
characteristic of ribonucleotides, 2 -methoxy and 2'-fluororibonucleotides analogues can be 
used, the latter being preferred on steric grounds. Nucleotides lacking a 2'-substituent adopt 
the undesired C 2 -endo configuration. 

25 The invention also relates to inhibitors identified by any of the techniques 

described herein. Because of the enzymatic nature of RNase P action, it is appreciated that 
inhibitors may be identified which act as transition state mimics, inhibitors of product 
release or inhibitors of substrate binding. 
Diagnostic Assays 

30 This invention is also related to the use of the RNaseP polynucleotides of the 

invention for use as diagnostic reagents. Detection of RNaseP in a eukaryote, particularly a 
mammal, and especially a human, will provide a diagnostic method for diagnosis of a disease. 
Eukaryotes (herein also "individual(s)"), particularly mammals, and especially humans, 
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infected with an organism comprising the RNaseP gene may be detected at the nucleic acid 
level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and 
tissues, such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly 
5 for detection or may be amplified enzymatically by using PCR or other amplification 
technique prior to analysis. RNA or cDNA may also be used in the same ways. Using 
amplification, characterization of the species and strain of prokaryote present in an individual, 
may be made by an analysis of the genotype of the prokaryote gene. Deletions and insertions 
can be detected by a change in size of the amplified product in comparison to the genotype of 

10 a reference sequence. Point mutations can be identified by hybridizing amplified DNA to 
labeled RNaseP polynucleotide sequences. Perfectly matched sequences can be distinguished 
from mismatched duplexes by RNase digestion or by differences in melting temperatures. 
DNA sequence differences may also be detected by alterations in the electrophoretic mobility 
of the DNA fragments in gels, with or without denaturing agents, or by direct DNA 

15 sequencing. See, e.g., Myers et aL, Science, 230: 1242 (1985). Sequence changes at specific 
locations also may be revealed by nuclease protection assays, such as RNase and SI 
protection or a chemical cleavage method. See, e.g., Cotton et aL, Proc. Natl. Acad. ScL, 
USA, 85: 4397-4401 
(1985). 

20 Cells carrying mutations or polymorphisms in the gene of the invention may also be 

detected at the DNA level by a variety of techniques, to allow for serotyping, for example. 
For example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT- 
PCR in conjunction with automated detection systems, such as, for example, GeneScan. RNA 
or cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PCR 

25 primers complementary to a nucleic acid encoding RNaseP can be used to identify and 
analyze mutations. Examples of representative primers are shown in the Examples. The 
invention further provides these primers with 1, 2, 3 or 4 nucleotides removed from the 5' 
and/or the 3' end. These primers may be used for, among other things, amplifying RNaseP 
DNA isolated from a sample derived from an individual. The primers may be used to amplify 

30 the gene isolated from an infected individual such that the gene may then be subject to various 
techniques for elucidation of the DNA sequence. In this way, mutations in the DNA sequence 
may be detected and used to diagnose infection and to serotype and/or classify the infectious 
agent. 



-35- 



WO 00/70025 



PCT/US00/13946 



The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections by Staphylococcus aureus, and most preferably disease, 
such as, infections of the upper respiratory tract (e.g., otitis media, bacterial tracheitis, acute 
epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), cardiac (e.g., 
5 infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, splenic absces, 
retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, 
keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), kidney and urinary 
tract (e.g., epididymitis, intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., 
impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, bacterial myositis) bone 
1 0 and joint (e.g., septic arthritis, osteomyelitis), comprising determining from a sample derived 
from an individual a increased level of expression of polynucleotide having the sequence of 
Table 1 [SEQ ID NO: 1]. Increased or decreased expression of RNaseP polynucleotide can 
be measured using any on of the methods well known in the art for the quantitation of 
polynucleotides, such as, for example, amplification, PCR, RT-PCR, RNase protection, 
1 5 Northern blotting and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of RNaseP protein compared to normal control tissue samples may be used to 
detect the presence of an infection, for example. Assay techniques that can be used to 
determine levels of a RNaseP protein, in a sample derived from a host are well-known to 
20 those of skill in the art. Such assay methods include radioimmunoassays, competitive-binding 
assays, Western Blot analysis and ELISA assays. 
Antibodies 

The polypeptides of the invention or variants thereof, or cells expressing them can be 
used as an immunogen to produce antibodies immunospecific for such polypeptides. 
25 "Antibodies" as used herein includes monoclonal and polyclonal antibodies, chimeric, single 
chain, simianized antibodies and humanized antibodies, as well as Fab fragments, including 
the products of an Fab immunolglobulin expression library. 

The Fab fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 
30 Antibodies generated against the polypeptides of the invention can be obtained by 

administering the polypeptides or epitope-bearing fragments, analogues or cells to an animal, 
preferably a nonhuman, using routine protocols. The antibody so obtained will then bind the 
polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
-36- 



WO 00/70025 



PCT/US00/13946 



Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
polypeptide. For preparation of monoclonal antibodies, any technique known in the art that 
provides antibodies produced by continuous cell line cultures can be used. Examples include 
various techniques, such as those in Kohler, G. and Milstein, C, Nature 256: 495-497 (1975); 

5 Kozbor et al, Immunology Today 4: 72 (1983); Cole et al., pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985), and the EBV-hybridoma 
technique to produce human monoclonal antibodies (Cole, et al., 1985, in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

The hybridomas are screened to select a cell line with high binding affinity and 

0 favorable cross reaction with other staphylococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the 
desired Mab. 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) 
can be adapted to produce single chain antibodies to polypeptides of this invention. Also, 
5 transgenic mice, or other organisms such as other mammals, may be used to express 
humanized antibodies. 

Alternatively phage display technology may be utilized to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-RNaseP or from naive 
:0 libraries (McCafferty, J. et al., (1990), Nature 348, 552-554; Marks, J. et al., (1992) 
Biotechnology 10, 779-783). The affinity of these antibodies can also be improved by 
chain shuffling (Clackson, T. et al., (1991) Nature 352, 624-628). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

15 As mentioned above, a fragment of the final antibody may be prepared. 

The antibody may be either intact antibody of M r approx 150,000 or a derivative of 
it, for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun, 
A., Science 240:1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed 'bispecific' antibodies. 
50 If two antigen binding domains are present each domain may be directed against a 

different epitope - termed 'bispecific' antibodies. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 
polypeptides in which one or more of the amino acid residues are modified may be used. 
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Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The above-described antibodies may be employed to isolate or to identify clones 
5 expressing the polypeptides to purify the polypeptides by affinity chromatography. 

Thus, among others, antibodies against RNaseP- polypeptide may be employed to 
treat infections, particularly bacterial infections and especially disease, such as, infections of 
the upper respiratory tract (e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), 
lower respiratory (e.g., empyema, lung abscess), cardiac (e.g., infective endocarditis), 

10 gastrointestinal (e.g., secretory diarrhoea, splenic absces, retroperitoneal abscess), CNS (e.g., 
cerebral abscess), eye (e.g., blepharitis, conjunctivitis, keratitis, endophthalmitis, preseptal and 
orbital cellulitis, darcryocystitis), kidney and urinary tract (e.g., epididymitis, intrarenal and 
perinephric absces, toxic shock syndrome), skin (e.g., impetigo, folliculitis, cutaneous 
abscesses, cellulitis, wound infection, bacterial myositis) bone and joint (e.g., septic arthritis, 

15 osteomyelitis). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
polypeptides of the invention. The choice of vector for the expression system will be 
determined in part by the host, which may be a prokaryotic cell, such as E. coli (preferably 

20 strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse CI 27, mouse myeloma, 
human HeLa, Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
Hiatt, A. et al, Nature 340:76-78(1989). Suitable vectors include plasmids, 
bacteriophages, cosmids and recombinant viruses, derived from, for example, baculoviruses 

25 and vaccinia. 

Polypeptide variants include antigenically, epitopically or immunologically 
equivalent variants that form a particular aspect of this invention. The term "antigenically 
equivalent derivative" as used herein encompasses a polypeptide or its equivalent which 
will be specifically recognized by certain antibodies which, when raised to the protein or 
30 polypeptide according to the invention, interfere with the immediate physical interaction 
between pathogen and mammalian host. The term "immunologically equivalent derivative" 
as used herein encompasses a peptide or its equivalent which when used in a suitable 
formulation to raise antibodies in a vertebrate, the antibodies act to interfere with the 
immediate physical interaction between pathogen and mammalian host. 
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The polypeptide, such as an antigenically or immunologically equivalent derivative 
or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation, with an immunogenic carrier 

5 protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic 

0 in the individual. For example, if the individual is human the antibody may most 
preferably be "humanized"; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
example as described in Jones, P. et al. (1986), Nature 321, 522-525 or Tempest et al., 
(1991) Biotechnology 9, 266-273. The humanized monoclonal antibody, or its fragment 

5 having binding activity, form a particular aspect of this invention. 

The modification need not be restricted to one of "humanization"; other primate 
sequences (for example Newman, R. ex al, Biotechnology 10:1455-1460 (1992)) may also 
be used. 

The use of a polynucleotide of the invention in genetic immunization will 
10 preferably employ a suitable delivery method such as direct injection of plasmid DNA into 

muscles (Wolff et al., Hum Mol Genet 1992, 1:363, Manthorpe et al., Hum. Gene Ther. 

1963:4, 419), delivery of DNA complexed with specific protein carriers (Wu et al., J Biol 

Chem. 1989: 264,16985), coprecipitation of DNA with calcium phosphate (Benvenisty & 

Reshef, PNAS USA, 1986:83,9551), encapsulation of DNA in various forms of liposomes 
15 (Kaneda et al., Science 1989:243,375), particle bombardment (Tang et al.. Nature 1992, 

356:152, Eisenbraun et al, DNA Cell Biol 1993, 12:791) and in vivo infection using cloned 

retroviral vectors (Seeger et al., PNAS USA 1984:81,5849). 

Vaccines 

Another aspect of the invention relates to a method for inducing an immunological 
50 response in an individual, particularly a mammal which comprises inoculating the 
individual with RNaseP, or a fragment or variant thereof, adequate to produce antibody and/ 
or T cell immune response to protect said individual from infection, particularly bacterial 
infection and most particularly Staphylococcus aureus infection. Also provided are 
methods whereby such immunological response slows bacterial replication. Yet another 
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aspect of the invention relates to a method of inducing immunological response in an 
individual which comprises delivering to such individual a nucleic acid vector to direct 
expression of RNaseP, or a fragment or a variant thereof, for expressing RNaseP, or a 
fragment or a variant thereof in vivo in order to induce an immunological response, such as, 
5 to produce antibody and/ or T cell immune response, including, for example, cytokine- 
producing T cells or cytotoxic T cells, to protect said individual from disease, whether that 
disease is already established within the individual or not. One way of administering the 
gene is by accelerating it into the desired cells as a coating on particles or otherwise. 
Such nucleic acid vector may comprise DNA, RNA, a modified nucleic acid, or a 

10 DNA/RNA hybrid. 

A further aspect of the invention relates to an immunological composition which, 
when introduced into an individual capable or having induced within it an immunological 
response, induces an immunological response in such individual to a RNaseP or protein 
coded therefrom, wherein the composition comprises a recombinant RNaseP or protein 

15 coded therefrom comprising DNA which codes for and expresses an antigen of said 
RNaseP or protein coded therefrom. The immunological response may be used 
therapeutically or prophylactically and may take the form of antibody immunity or cellular 
immunity such as that arising from CTL or CD4+ T cells. 

A RNaseP polypeptide or a fragment thereof may be fused with co-protein which 

20 may not by itself produce antibodies, but is capable of stabilizing the first protein and 
producing a fused protein which will have immunogenic and protective properties. Thus 
fused recombinant protein, preferably further comprises an antigenic co-protein, such as 
lipoprotein D from Hemophilus influenzae, Glutathione-S-transferase (GST) or beta- 
galactosidase, relatively large co-proteins which solubilize the protein and facilitate 

25 production and purification thereof. Moreover, the co-protein may act as an adjuvant in the 
sense of providing a generalized stimulation of the immune system. The co-protein may be 
attached to either the amino or carboxy terminus of the first protein. 

Provided by this invention are compositions, particularly vaccine compositions, and 
methods comprising the polypeptides or polynucleotides of the invention and 

30 immunostimulatory DNA sequences, such as those described in Sato, Y. et al. Science 273: 
352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or 
particular fragments thereof which have been shown to encode non-variable regions of 
bacterial cell surface proteins in DNA constructs used in such genetic immunization 
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experiments in animal models of infection with Staphylococcus aureus will be particularly 
useful for identifying protein epitopes able to provoke a prophylactic or therapeutic 
immune response. It is believed that this approach will allow for the subsequent 
preparation of monoclonal antibodies of particular value from the requisite organ of the 
5 animal successfully resisting or clearing infection for the development of prophylactic 
agents or therapeutic treatments of bacterial infection, particularly Staphylococcus aureus 
infection, in mammals, particularly humans. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
specific antibodies which protect against invasion of bacteria, for example by blocking 

10 adherence of bacteria to damaged tissue. Examples of tissue damage include wounds in 
skin or connective tissue caused, e.g., by mechanical, chemical or thermal damage or by 
implantation of indwelling devices, or wounds in the mucous membranes, such as the 
mouth, mammary glands, urethra or vagina. 

The invention also includes a vaccine formulation which comprises an 

1 5 immunogenic recombinant protein of the invention together with a suitable carrier. Since 
the protein may be broken down in the stomach, it is preferably administered parenterally, 
including, for example, administration that is subcutaneous, intramuscular, intravenous, or 
intradermal. Formulations suitable for parenteral administration include aqueous and non- 
aqueous sterile injection solutions which may contain anti-oxidants, buffers, bacteriostats 

20 and solutes which render the formulation insotonic with the bodily fluid, preferably the 
blood, of the individual; and aqueous and non-aqueous sterile suspensions which may 
include suspending agents or thickening agents. The formulations may be presented in 
unit-dose or multi-dose containers, for example, sealed ampules and vials and may be 
stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier 

25 immediately prior to use. The vaccine formulation may also include adjuvant systems for 
enhancing the immunogenicity of the formulation, such as oil-in water systems and other 
systems known in the art. The dosage will depend on the specific activity of the vaccine 
and can be readily determined by routine experimentation. 

While the invention has been described with reference to certain RNaseP protein, it 

30 is to be understood that this covers fragments of the naturally occurring protein and similar 
proteins with additions, deletions or substitutions which do not substantially affect the 
immunogenic properties of the recombinant protein. 



-41 - 



WO 00/70025 



PCT/US00/13946 



Compositions, kits and administration 

The invention also relates to compositions comprising the polynucleotide or the 
polypeptides discussed above or their agonists or antagonists. The polypeptides of the 
invention may be employed in combination with a non-sterile or sterile carrier or carriers for 
5 use with cells, tissues or organisms, such as a pharmaceutical carrier suitable for 
administration to a subject. Such compositions comprise, for instance, a media additive or a 
therapeutically effective amount of a polypeptide of the invention and a pharmaceutically 
acceptable carrier or excipient. Such carriers may include, but are not limited to, saline, 
buffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The formulation 

10 should suit the mode of administration. The invention further relates to diagnostic and 
pharmaceutical packs and kits comprising one or more containers filled with one or more of 
the ingredients of the aforementioned compositions of the invention. 

Polypeptides and other compounds of the invention may be employed alone or in 
conjunction with other compounds, such as therapeutic compounds. 

15 The pharmaceutical compositions may be administered in any effective, convenient 

manner including, for instance, administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an 
individual as an injectable composition, for example as a sterile aqueous dispersion, 

20 preferably isotonic. 

Alternatively the composition may be formulated for topical application 
for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 

25 penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

30 For administration to mammals, and particularly humans, it is expected that the 

daily dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg, typically 
around 1 mg/kg. The physician in any event will determine the actual dosage which will be 
most suitable for an individual and will vary with the age, weight and response of the 
particular individual. The above dosages are exemplary of the average case. There can, of 
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course, be individual instances where higher or lower dosage ranges are merited, and such 
are within the scope of this invention. 

In-dwelling devices include surgical implants, prosthetic devices and catheters, i.e., 
devices that are introduced to the body of an individual and remain in position for an 
5 extended time. Such devices include, for example, artificial joints, heart valves, 
pacemakers, vascular grafts, vascular catheters, cerebrospinal fluid shunts, urinary 
catheters, continuous ambulatory peritoneal dialysis (CAPD) catheters. 

The composition of the invention may be administered by injection to achieve a 
systemic effect against relevant bacteria shortly before insertion of an in-dwelling device. 
10 Treatment may be continued after surgery during the in-body time of the device. In 
addition, the composition could also be used to broaden perioperative cover for any surgical 
technique to prevent bacterial wound infections, especially Staphylococcus aureus wound 
infections. 

Many orthopaedic surgeons consider that humans with prosthetic joints should be 
15 considered for antibiotic prophylaxis before dental treatment that could produce a 
bacteremia. Late deep infection is a serious complication sometimes leading to loss of the 
prosthetic joint and is accompanied by significant morbidity and mortality. It may 
therefore be possible to extend the use of the active agent as a replacement for prophylactic 
antibiotics in this situation. 
20 In addition to the therapy described above, the compositions of this invention may 

be used generally as a wound treatment agent to prevent adhesion of bacteria to matrix 
proteins exposed in wound tissue and for prophylactic use in dental treatment as an 
alternative to, or in conjunction with, antibiotic prophylaxis. 

Alternatively, the composition of the invention may be used to bathe an indwelling device 
25 immediately before insertion. The active agent will preferably be present at a concentration 

of lug/ml to lOmg/ml for bathing of wounds or indwelling devices. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 

may be employed to enhance the immune response. A suitable unit dose for vaccination is 

0.5-5 microgram/kg of antigen, and such dose is preferably administered 1-3 times and 
30 with an interval of 1-3 weeks. With the indicated dose range, no adverse toxicological 

effects will be observed with the compounds of the invention which would preclude their 

administration to suitable individuals. 
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Each reference disclosed herein is incorporated by reference herein in its entirety. 
Any patent application to which this application claims priority is also incorporated by 
reference herein in its entirety. 
Examples 

5 The present invention is further described by the following examples. The examples 

are provided solely to illustrate the invention by reference to specific embodiments. These 
exemplifications, while illustrating certain specific aspects of the invention, do not portray the 
limitations or circumscribe the scope of the disclosed invention. 

All examples were carried out using standard techniques, which are well known and 

0 routine to those of skill in the art, except where otherwise described in detail. Routine 
molecular biology techniques of the following examples can be carried out as described in 
standard laboratory manuals, such as Sambrook et al., MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.(1989). 

5 All parts or amounts set out in the following examples are by weight, unless 

otherwise specified. 

Unless otherwise stated size separation of fragments in the examples below was 
carried out using standard techniques of agarose and polyacrylamide gel electrophoresis 
("PAGE") in Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd 
;0 Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989) and numerous 
other references such as, for instance, by Goeddel et al., Nucleic Acids Res. 8: 4057 ( 1 980). 
Example 1. PCR with random primers 

This method describes a rapid way to obtain additional sequence data from partial 
gene fragments independent of probing a genomic library. 
15 Random sequencing of a S. aureus genomic library followed by sequence 

homology searching with the B. subtilis P-protein sequence resulted in the identification of 
a 324 base pair (herein "bp") fragment. The first 193 nucleotides of this fragment showed 
significant homology to the C-terminal half of the B. subtilis P-protein and other 
prokaryotic RNase P proteins. The N-terminal domain of the putative S. aureus RNase P 
(0 protein was putatively missing. 

Deduced S. aureus RNase P protein sequence versus B. subtilis shows 58.7% 
similarity, 34.9% identity. 

A novel two step PCR with two different reverse primers complementary to the 
known sequence in position 15-39 and 194-215 respectively (shown below, Table 2) as well 
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as random hexamer primers (Gibco) was used to obtain the complete 5" sequence of the S. 
aureus spp gene. S. aureus genomic DNA partially digested with Hind III or Pst I served as 
a template. Primer #1 (position 194-215) annealed just downstream of the stop codon TAA 
and primer #2 (position 15-39) close to the end of the known sequence (see Table 2). 
5 Table 2 

Primer Position 

5' GAATT CCGTG TTAAG AAACA AGATT AAAAG AGCAA TACGT GAAAA TTTCA AAGTA CATAA 
GTCGN 

3'C TTTGT TCTAA TTTTC TCGTT ATGC 5' [SEQ ID NO : 7 ] 
10 ATATA— TTGGN CAAAG ATATT ATTGN TATAG NAAGA CAGGC AGCTA AAGAT ATGAC GACTT 
TACAA 

ATACA GRATA GTCTT GAGCA CGTNC TTAAA ATTGG CAAAG TTTTT AATAA AAAGA TTAAG 
TAAGG 

15 

3 'CC 

ATAGG GTAGG GAATG AAAAC ATTAA CCCCT CAACG CATCC CGAAG TCTTA CCTCA GACAA 
ACGTT 

TATCC CATCC CTTAC TTTTG 5' [SEQ ID NO: 8] 
20 AGACT GACCC TAGGG TTAAG ACTTG GCCCN AGGGN TATNN TAACT TACTT TAAAA TGTTT 
TCAC 3' [SEQ ID NO: 4] 

In the first step, the reverse strand encoding the C-terminus and upstream the 
unknown N-terminal sequence of the protein was amplified using primer #1, resulting in 
25 single stranded products. This amplification of the desired sequence favoured the binding 
of random primers to that sequence in the next step. 

In the second step, 32p. e nd-labeled primer #2 and random hexamers were added 
and the annealing temperature was dropped to 25°C to allow the short random primers to 
anneal to the DNA. 

30 The PCR is expected to generate many different fragments, however the major 

products should be those primed by primer #2 and a random primer (see below). Only these 
products were of interest as they should encode the N-terminal half of the protein and they 
could be monitored since they would be radiolabeled. After separation of the PCR products 
by gel electrophoresis and exposure to film, a limited number of radiolabeled bands should 

35 be visible on the autoradiogram. These fragments can subsequently be cloned into a 
suitable vector, e.g., pUC19, and sequenced using routine methods. 
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Example 2. PCR of the spp 5' domain with random primers 

The 5' half of the spp gene was amplified in a two step PCR reaction as described 
herein. The PCR products were cleaned using a QIAquick PCR purification column and 
recovered in 50 ul water. The sample was concentrated to a final volume of 25 p.1 under 
5 vacuum in a SpeedVac® SCI 10 (Savant). 20ul were loaded onto a 1.6 mm, 8% 
polyacrylamide gel containing 7M urea and analyzed by gel electrophoresis. The gel was 
stained with ethidium bromide (1 ug/ml in H20) to visualize the DNA and photographed. 
The gel was then dried and exposed to film. 

There were several defined bands visible both on the ethidium bromide stained gel 
10 as well as on the autoradiogram. They ranged in size between 50 bp to over 2000 bp, some 
bands being more intense than others. The most prominent bands were approximately 50bp, 
150bp, 400bp and 800bp in size. 

Example 3- Shotgun cloning of the PCR fragments: Cloning and sequencing of the 
PCR fragments 

15 The PCR fragments were blunt end cloned into Smal cut pUC19. Ligations of 100 

ng vector DNA with 1.0 pi and 1.5 pi of PCR products were performed at 16°C overnight. 
E. coli XL1 blue cells (a-complementing) were transformed by electroporation and plated 
onto selective plates containing IPTG and x-gal (for blue/white screening). 
Transformation of these cells resulted in approximately 8.5% white colonies in the presence 

20 of PCR products, whereas only 4.6% white colonies could be found in the control ligation 
without insert. White colonies had lost the ability to form the {3-galactosidase holoenzyme 
as a result of a reading frame disruption in the lacZ' gene and were no longer able to cleave 
x-gal. The reading frame was either disrupted by an insert (in the presence of PCR 
products) or exonuclease activity of the restriction enzyme preparation, destroying the Sma 

25 I site of the vector by cleaving terminal nucleotides and resulting in frame shifts after 
ligation. 

1 8 white colonies were picked and grown up as 2 ml cultures to prepare plasmids. 
A sample of each plasmid was linearized with AccI restriction endonuclease (cuts in the 
multi cloning site) and run on a 1% TAE agarose gel alongside the original pUC19. 
30 The results showed that 7 plasmids released a short fragment upon restriction 

digest. As there is only one AccI site in pUC19, these recombinant plasmids must contain 
an insert with a restriction site for this enzyme. 3 plasmids appeared larger than pUC19 but 
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did not release a fragment. The other 8 plasmids were of the same size as pUC19 and did 
not appear to have an insert. 

The 10 plasmids containing an insert were sequenced both with the forward and 
reverse pUC/M 1 3 sequencing primers. 
5 5 of the 6 plasmids that released a fragment after AccI digest harboured an 213bp 

insert that started with the primer #2 sequence and the known 14bp followed by a sequence 
of 174 nucleotides that was common to all of the plasmids (Sequence 1. Table 3). In one of 
the 6 plasmids the insert (Sequence 2. Table 3) was slightly different: the first 137 
nucleotides were in common with Sequence 1 (dotted line), however the insert sequence 
10 extended beyond this and deviated from Sequence 1 (see Table 3): 

Table 3 

Aplifi cation Product Sequence 1, 2 and 3 
15 Sequence 1 [SEQ ID NO: 9] 

5'GTTTT CATTC CCTAC CCTAT CCTTA CTTAA TCTTT TTATT . .AAAAA 
GAATG . _C AGAT. . TTTCA . _G AGAA . TATAT . .AAAAA _ AGGTC _ ATTCT. _ GTAGC 
CAACA . GACAA. . TTTGT. TGTAT . ACACT . TGTAA _ TAATA . AAGAA _ ATAGA 
CC ATT . TTCGC__ TT AGG. ._TATTA_ _ GTGTT _ .TCTAA . .AAAAC TAGG T AATGC 
20 AGTGT TAA GA ACAAG ATTAA AAGAG CAATA CGT 3' 

Sequence 2 [SEQ ID NO: 10] 

5' ACGAT AAATA ACAGT ATGTT ATTGG AAAAA GTCTA CCGAA..TTAAA. .AAGAA 

TGCAG . ATTTT . . C AG AG . . .AATAT. . AT AAA. . AAAGG. . _TC ATT . . CTGT A . . GCC AA . . C AG AC . . _aATTT 
25 GTTGT . AT AC A . CTT_GT_ . _AATAA_ _ _TAAAG. . .AAATA. . _G AC C A. . . TTTTC . . GCTT A . . GGTAT . . .TAGTG 
T rp Tr rr AAAAA ACTAG GTAAT GCAGT GTTAA GAACA AGATT AAAAG AGCAA TACGT 3' 

Sequence 3 [SEQ ID NO: 11] 

5 ' . . . GAAAA TTTCA AAGTA CATAA GTCGC ATATA TTGGC CAAAG ATTGT AATAG 
30 CAAGA CAGCC AGCTA AAGAT ATGAC GACTT TACAA ATACA GAATA GTCTT GAGCA 
CGTAC TTAAA ATTGC CAAAG TTTTT AATAA AAAGA TTAAG TAA GG ATAGG GTAGG 
GAATG AAAAC 3' 

stop codon (underlined) 

35 Primer #2 sequence is shown in bold and also the known 14 bases underlined (3 

changes to known sequence in italic). 
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The predicted translation products of Sequence 1 and 2 showed significant 
homology to B. subtilis P-protein and indicated that these inserts contain the full sequence 
information for the 5' domain of the spp gene. 

In 2 plasmids the insert started with the sequence of primer #1 followed by 3' half 
5 of the spp gene (Sequence 3, Table 3). This sequence was already known but the new 
sequence showed 9 base changes (shown in bold) in comparison to the original sequence 
data obtained by random sequencing of the S. aureus library. 

2 plasmids contained inserts with different sequences that are a result of unspecific 
priming during PCR. 
1 0 Example 4 Analysis of the sequence data 

The newly obtained sequence data was drawn together and predicted to represent 
the complete sequence of the spp structural gene. The reading frame was determined from a 
partial sequence of the 3' domain. According to this reading frame the start codon ATG 
could be identified 174bp upstream of primer #2 sequence (see above). The total length of 
15 the gene is 354bp. 

The nucleotide sequence of the spp gene is shown in Table 1 [SEQ ID NO: 2]; the 
start codon (ATG) and stop codon (TAA) are both shown in bold. The deduced translation 
product of the spp gene (SP protein) is 1 17 amino acids is also shown in Table 1 [SEQ ID 
NO:2]. 

20 The translated sequence was aligned with known amino acid sequences of the 

RNase P protein from 5 different prokaryotes. The S. aureus SP protein sequence matched 
the B. subtilis P-protein sequence well, significant homology to the B. subtilis protein. 
Example 5. PCR of the spp gene 

The newly obtained sequence data was used to design PCR primers for the 
25 amplification of the full length spp structural gene. 

The PCR resulted in a single band of approximately 360bp with a total DNA yield 
of 2 u\g as determined by spectrophotometrical measurement. The DNA fragment was 
cleaned using a QIAquick PCR purification column and recovered in 80u.l water. 

30 Example 6. Cloning spp into pMa!c2 

The pMalc2 vector provides a method for expressing and purifying a protein 
produced from a cloned gene (New England BioLabs) (see also, Guan, C, Li, P., Riggs, 
P.D. and Inouye, H. (1987) Gene 67, 21-30; Maina, C.V. et al. (1988) Gene 74, 365-373: 
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Riggs, P. in Ausubel, F.M. et al. (eds) Current prot. in Molecular Biol. (1992): Kellerman 
and Ferenci (1982) Methods in Enzymol. 90, 459-463; Yanisch-Perron, C. et al (1985) 
Gene 33, 103-1 19; Zagursky, R.J. and Berman, M.L. (1984) Gene 27, 183-191, regarding 
the vector and its uses). 

5 The cloned gene is inserted downstream from the malE gene of E. coli, which 

encodes the maltose-binding-protein (MBP), resulting in the expression of an MBP fusion 
protein. The method uses the strong "tac" promoter under the control of the lac repressor 
and a one step purification of the fusion protein using MBP's affinity for maltose. Unique 
restriction sites are available between malE and lacZa for inserting the gene of interest, 

10 allowing blue/white screening of transformants. The vector also contains a recognition site 
for the specific Factor Xa protease located just 5' to the polylinker. This allows MBP to be 
cleaved after purification without leaving any vector derived residues attached to the 
protein of interest when the gene is cloned into the XmnI site. 
Example 7. Ligation of the spp PCR product into pMalc2 

15 1 of the protein fusion vector pMalc2 was cut with XmnI and BamHI restriction 

endonucleases to enable directional cloning of the spp structural gene. To remove the small 
15 bp insert the digest was cleaned using QIAquick PCR purification column and the DNA 
recovered in 90 u.1 water. A sample was analysed by agarose gel electrophoresis to ensure 
complete digestion of the vector. After 30 min of incubation at 37°C with both enzymes the 

20 vector was completely cut. 

The spp PCR product was digested with BamHI and cleaned using a QIAquick 
PCR purification column. 

Vector and insert were ligated overnight in a 20uJ reaction at 16°C. The molar ratio 
of vector to insert was 1:5. To prepare the DNA for electroporation, the ligation was 

25 desalted using QIAquick Nucleotide Removal columns. 

Example 8. Transformation of E. coli XL-1 Blue cells 

40 nl electrocompetent E. coli XL-1 Blue cells were transformed with 2 uJ desalted 
ligation reaction. Cells were recovered in 1ml SOC medium after electroporation. 50 jal of a 
1:100 dilution were plated onto an LB plate containing ampicillin, IPTG and x-gal and 

30 incubated at 37°C overnight. 1 1 white and 1 blue colony were recovered. The 1 1 white 
colonies were grown up and the plasmids were isolated from the clones. Sequencing of the 
plasmids showed that 6 of the 1 1 plasmids contained the correct spp insert. The resulting 
plasmid was denoted pMalc2::spp, and the fusion protein product MBP-SPP. 
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Example 9 Growth of cells and fusion protein induction in E. coli XL-1 Blue 
pMalc2::sp/7 

A 1 liter culture of E. coli XL-1 Blue pMalc2::.sp/> was grown at 37°C, 220 rpm and 
expression of the MBP-SPP fusion induced by adding ImM IPTG at an A 600 of 0.6. 
5 Induction was continued for 2 hours. The A^00 wa s monitored over that time period to 
ensure that cells were still growing during induction. 1ml samples were removed from the 
culture after 0.5, 1 .0 1 .5 and 2 hours, the cells pelleted and the pellet resuspended in 100 ul 
SDS gel loading buffer. 3 (il of the samples were analysed by SDS-PAGE with Coomassie 
staining. 

10 The cells were not effected by the overexpression of the fusion protein, they kept 

growing and after 2 hours of IPTG induction reached an A^00 Q f j 43 

Protein extracts from samples were separated by SDS-PAGE and transferred onto a 
nitro-cellulose membrane by Western transfer. The MBP-SPP fusion protein could be 
immuno-detected by incubation of the membrane with anti-MBP rabbit serum and HRP- 

15 coupled anti-rabbit antibodies followed by ECL detection. The Western Blot revealed that 2 
proteins were induced upon IPTG addition. The major protein was of the expected size of 
approximately 56kD, the second protein, produced in lower amounts, was 10-15 kD larger. 
Analysis of the spp sequence revealed an accumulation of AUA (ile) and AGA (arg) codons 
that are rare in E. coli. The normal frequency of AGA in E. coli is 0.27%, of AUA 0.51%. 

20 The spp sequence consists to 4.3% of AGA and 5.1% of AUA. These codons may cause 
frame shifts and other altered translation events when overexpressing a protein with a high 
abundance of rare codons. 

Another E. coli strain was available that constituively expresses the ArgU tRNA 
(tRNA ucu ) and IleX tRNA (tRNA UAU ) from the plasmid pRI952. 

25 Example 10. Transformation of E. coli W3110 pRI952 

50 ul electrocompetent E. coli W3110 pRI952 cells were transformed with 2 u.1 
desalted ligation reaction. Cells were recovered in 1ml SOC medium after electroporation. 
lOOul of the culture were plated onto selective LB plates (ampillin, cam) containing IPTG 
and x-gal and incubated at 37°C overnight. Too many colonies grew on the plate and could 

30 not be counted. 5 white colonies were selected and their plasmids isolated. Sequencing of 
the plasmids showed that all contained the correct spp insert. 
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Example 11. Growth of cells and fusion protein induction in E. coli W3110 pRI952 
pMalc2::s/<7? 

The cells were grown and the expression of the fusion protein was induced as 
described above. The cells kept growing during the induction period from an A 600 of 0.6 to 
5 1 .45. Samples were taken from the culture at 0.0, 0.5, 1 .5 and 2.5 hours after IPTG addition 
and protein extracts analysed by SDS-PAGE with Coomassie staining, as well as Western 
transfer with immunodection. Overexpression of the rare tRNAs UeX and ArgU resulted in 
only one MBP-SPP fusion protein being overexpressed during IPTG induction as detected 
by the anti-MBP serum. 
10 Example 12. Purification of the MBP-SPP fusion by affinity chromatography 

Following optimal induction, the cells (E. coli W3110 pPJ952 pMa\c2::spp) were 
harvested and disrupted by sonication after a freeze-thaw cycle to weaken the cell wall 
using known methods. The cell debris was removed by centrifugation and the protein 
concentration of the crude extract determined using the BIO-RAD Protein Assay. The total 
15 amount of protein in the crude extract was 86 mg. A sample of the crude extract was 
analysed by SDS-PAGE with Coomassie staining which showed that the fusion protein was 
released from the cells during sonication. 

The crude extract was then pumped through the amylose resin column, to which the 
MBP-SPP fusion bound, and unbound protein was removed from the column by extensive 
20 washing with column buffer. The fusion protein was then eluted with column buffer 
supplemented with lOmM maltose using methods described herein. The eluate was 
collected as 3 ml fractions. Samples were removed during the purification procedure and 
analyzed by SDS-PAGE with Coomassie staining. Almost all of the fusion protein bound to 
the amylose resin, only trace amounts were found in the flow-through and no fusion protein 
25 was washed off the column. Fractions 4-1 1 contained reasonable amounts of protein and 
were pooled, yielding in 25 ml protein solution with a concentration of 1 mg/ml. 
Example 13. Factor Xa cleavage of the fusion protein and separation of MBP and 
SPP 

500 fig Factor Xa protease were added to the solution and the fusion protein was 
30 cleaved overnight at 4°C. A sample was analysed by SDS-PAGE before and after Factor Xa 
incubation to monitor complete cleavage. After 15 hours almost all the fusion protein was 
cleaved resulting in the 42 kD MBP and SPP of approximately 14kD. 

The SP protein readily precipitated upon Factor Xa cleavage and could be separated 
from MBP by centrifugation. The SPP pellet was washed 3 times and then resuspended in 
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5ml column buffer. The MBP remained in solution and was found in the supernatant 
whereas the pellet consisted mainly of the SP protein and one minor contaminating protein, 
resulting in a reasonably pure preparation of SPP. The total SPP yield was 2.18 mg/liter 
culture. The SPP preparation was diluted with column buffer to a final concentration of 
5 0.32 mg/ml. ImM DTT was added to prevent intermolecular disulfide bond formation 
between cystein residues. The addition of 7M urea denatured and apparently completely 
resolubilized the protein. The protein was aliquoted, snap-frozen and stored at -80°C. 
Example 14. In vitro transcription of the SP RNA 

SP RNA was transcribed in vitro from the plasmid pSPR using T7 RNA 
10 polymerase. pSPR is a pUC19 derivative that harbours the S. aureus spr gene behind a T7 
promotor. The plasmid was linearized with BamHI restriction endonuclease to enable run 
off transcription. The Ambion MEGAscript™ T7 kit was used for large scale transcription 
of the SP RNA. The transcription reaction was performed at 30°C to allow slow folding of 
the RNA into the correct conformation during synthesis. The RNA was cleaned under non- 
15 denaturing conditions using CLONTECH Chroma-Spin 30 columns and the RNA recoverd 
in 40 (0.1 water. The quality of the transcript was monitored by gel electrophoresis on a 5% 
polyacrylamide TBE gel with 7M urea. The RNA was visualized by UV-shadowing over a 
TLC plate. 

In vitro transcription of SP RNA from pSPR resulted in a single RNA product of 
20 the expected size of 401 nucleotides. The total RNA yield was 140 |ig (determined 
spectrophotometrically). After cleaning, the RNA was recovered in 40 \x\ water resulting in 
a 29 uM solution, that was stored at -20°C. 

Example 15. In vitro transcription of E. coli ptRNA met incorporating [a- 32 P] UTP 

E. coli ptRNA met , a substrate for RNase P, was transcribed in vitro by T7 RNA 
25 polymerase (Epicenter) in the presence of [ct-32p] TjTP to internally label the RNA. The 
DNA template pGem3Z-ptRNA was linearized with BstNI restriction endonuclease to 
allow run-off transcription. The transcription was performed in a 20 |Lxl reaction at 37°C for 
60min. To remove unincorporated nucleotides the 93nt RNA (sequence shown below) was 
cleaned under non-denaturing conditions using a CLONTECH Chroma-Spin 10 column. To 
30 calculate the specific activity of the RNA probe, lul samples were removed before and 
after cleaning, mixed with 5 ml scintillation cocktail and counted in a scintillation counter. 

ptRNA met in pGem-3Z, sequence of the transcript (93mer) 5'-» 3' [SEQ ID NO: 

12]: 
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GGGCG AAUUC GCCUC GGCUA CGUAG CUCAG UUGGU UAGAG CACAU 
CACUC AUAAU GAUGG GGUCA CAGGU UCGAA UCCCG UCGUA GCCAC CAG 

Transcription of ptRNA™' resulted in a 2pM (6 ng/u.1) solution. 64.3% of the 
nucleotides were incorporated in the transcript and the specific activity was 2.171 10" 

5 cpm/pg. 

Example 16. 5' end-labeling of p6AT-l 

The chemically synthesized 42nt RNA molecule p6AT-l was 5' end-labeled with 
high specific activity [y-32p] ATP (6000pCi/pl) using bacteriophage T4 polynucleotide 
kinase. The labeling was performed in a 10 jil reaction at 37°C for 30 min. The enzyme was 

0 heat-inactivated at 95°C for 2 min. To remove salts, unincorporated nucleotides and the 
enzyme, p6AT-l was gel purified from a 20% polyacrylamide TBE gel with 7M urea. The 
RNA was extracted from the gel slice and precipitated with 300mM NaOAc and 2.5 
volumes 100% ethanol. After a spin at 14.000g and 4°C the RNA pellet was washed twice 
with chilled 70% ethanol. The pellet was air dried and subsequently resuspended in 50 pi 

5 water. 0.5 pi were spotted onto a filter paper disk and counted in a scintillation counter 
(Cherenkov analysis). 

The end-labeling resulted in a p6AT-l preparation with an activity of 640,000 

cpm/pl. 

Example 17. Determination of the optimal buffer conditions for SP RNA reactions 

,0 E. coli Ml RNA is able to cleave ptRNA nKI or the minimal substrate p6AT-l in 

lOOmM NH4C1, lOOmM TrisCl pH 7.5, lOOmM MgCl 2 . The conditions for cleavage of 
either substrate by S. aureus SP RNA were unknown and clearly deviated from those 
optimized for E. coli. A large number of different buffer conditions were investigated in 
order to determine optimal buffer conditions for either substrate cleavage by the S. aureus 

15 ribozyme. 

Cleavage reactions (20 pi) were all performed at 37°C for 30 min with lOOnM SP 
RNA and an additional 200nM SP protein in the holoenzyme reactions. Substrates 
(ptRNA met or p6AT-l) were added in trace amounts (single-turnover reactions). Cleavage 
was monitored by size resolution of the intact precursor and the cleaved leader sequence by 
\0 denaturing gel elctrophoresis and autoradiography or visualization on the phosphorimager 
(Molecular Dynamics Storm 860). 
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Example 18 Effect of KC1 versus NH4CI on p6AT-l cleavage 

All experiments were performed in lOOmM TrisCl pH7.5, lOOmM MgC12 and 1M 
or 2M monovalent salt (KC1 or NH4C1). 

There was hardly any cleavage product detectable at 1M KC1 or NH4C1. Cleavage occurred 
5 when 2M monovalent salt were used. Potassium chloride gave better results than 
ammonium chloride. 

Further optimization of the monovalent salt concentration was investigated 
following optimization of additional parameters described in the following sections. 
Example 19. Effect of the pH on p6AT-l cleavage 
10 Buffers containing lOOmM MgC12, 2M KC1 and lOOmM TrisCl at a pH of 7.0, 7.5 

and 8.0 were tested for the ability to promote p6AT-l cleavage by SP RNA. 
The substrate was cleaved in all three buffers but cleavage was very poor at pH 7.0 and 
optimal at pH 8.0. 

Example 20. Influence of PEG 8,000 on p6AT-l cleavage 
15 The addition of polyethylene glycol has been shown to improve substrate cleavage 

by E. coli Ml RNA. Here the effect of different PEG 8,000 concentrations on p6AT-l 
cleavage by SP RNA was demonstrated. 1%, 2.5% and 5% PEG were added to reactions 
containing 1M KC1, lOOmM TrisCl pH8. lOOmM MgC12. 

The ability to cleave the substrate improved with increasing PEG concentrations, and 
20 therefore 5% PEG was selected to be included in subsequent reactions. 

Example 21. MgCl2 requirements for p6AT-l processing 

The MgC12 concentration was successively reduced from lOOmM to lOmM to 

determine the lowest magnesium ion concentration at which the SP RNA could still process 

the substrate. lOOmM, 50mM, 25mM and lOmM MgC12 were tested in a buffer containing 
25 2M KC1, lOOmM Tris CI pH8 and 5% PEG. 

Cleavage results were obtained at lOOmM MgC12. However SP RNA was still able 

to process p6AT-l at 50mM and to some extend at 25mM MgC12. No cleavage could be 

observed at lOmM MgCI2. 

Example 22. Determination of KC1 requirements for p6AT-l cleavage 

30 The KC1 concentration was successively increased from 40mM to 1.5M in lOOmM 

TrisCl pH8, lOOmM MgC12 and 5% PEG. 

No cleavage occurred at KC1 concentrations below 400mM. At 400mM KC1 p6AT- 
1 processing was still very poor but increased with the salt concentration to reach a 
maximal level of 27% at 1.5M KC1. That was comparable to p6AT-l cleavage by E. coli 
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Ml RNA, where 35% of the substrate was cleaved after 20min. The best buffer conditions 
for p6AT-l cleage by SP RNA were 1.5M KC1, lOOmM TrisCl pH8. lOOmM MgC12, 5% 
PEG. 

Example 23. KC1 requirements for ptRNA™' cleavage by SP RNA 
5 ptRNA" 101 was cleaved under the same buffer conditions as described above. 

ptRNA™' was a better substrate than p6AT-l. By contrast to p6AT-l, ptRNAmet could be 
processed at KC1 concentrations below 400mM. Some cleavage could already be detected 
at 20mM KC1, whereas no cleavage of p6AT-I was observed under these conditions. The 
cleavage rate increased with the KC1 concentration. At 150mM KC1 more than 50% of the 

10 substrate was processed, reaching 82% at 800mM KC1. At concentrations of 1M or above 
85% of the ptRNA were cleaved, and no further increase could be detected, indicating that 
15% of the substrate was uncleavable. E. coli Ml RNA as a positive control cleaved 80% 
of the substrate in lOOmM Tris.Cl pH7.5, lOOmM NH4C1, lOOmM MgC12 within 20min. 
The best buffer conditions for pfRNA™ et cleavage by SP RNA were 1M KC1, lOOmM 

1 5 Tris CI pH8, 1 OOmM MgC12, 5% PEG. 

Example 24. Determination of the optimal buffer conditions for S. aureus RNase P 
holoenzyme reactions: p6AT-l cleavage by the holoenzyme 

The S. aureus RNase P holoenzyme cleavage of p6AT-l was investigated over a 
range of different KC1 concentrations at low magnesium ion concentrations. All reactions 

20 were performed in lOOmM TrisCl pH8, lOmM MgC12, 5% PEG and 40mM to 1.5M KC1. 
Cleavage could already be observed at 40mM KC1. The cleavage rate increased with the 
concentration of monovalent salt, peaking at 150mM, where 50% of the substrate was 
processed. At higher KC1 concentrations the cleavage rate decreased again and p6AT-l 
processing did not occur at KC1 concentrations of 600mM and above. The E. coli 

25 holoenzyme control cleaved 57% of the p6AT-l within 20min in lOOmM Tris.Cl pH7.5, 
lOOmM NH4C1, lOmM MgC12. The optimal buffer for p6AT-l cleavage by the 
holoenzyme was 150mM KC1, lOOmM Tris.Cl pH 8.0, lOmM MgCl 2 , 5% PEG 8000. 
p6AT-l was a better substrate for the holoenzyme than for SP RNA, as under optimal 
conditions only 27% p6AT-l was cleaved as opposed to 50% by the 5. aureus RNase P 

30 holoenzyme. 

Example 25. MgCl2 requirements for ptRNA cleavage by the holoenzyme 

The S. aureus RNase P holoenzyme failed to cleave ptRNA met under the same 
conditions employed for p6AT-l processing. The holoenzyme was not able to cleave the 
substrate at lOmM MgCl 2 under any given KC1 concentration. The concentration of 
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magnesium ions was increased to 20mM and ptRNA cleavage by the holoenzyme as well as 
the SP RNA ribozyme were tested at different KC1 concentrations. The buffers contained 
lOOmM TrisCl pH8, 20mM MgCl 2 , 5% PEG and lOmM, 50mM, 150mM, 600mM or 1.5M 
KC1. 

5 Cleavage of ptRNA by the holoenzyme at lOmM MgCb and low KC1 

concentrations was very poor. At 20mM MgC12 and 150mM KC1 the holoenzyme 
processed ptRNA very well, 54% of the substrate was cleaved within a 20min incubation. 
Only a low percentage of substrate processing could be detected at 600mM KC1, but good 
cleavage occurred at 1.5M KCI. The SP RNA alone was also able to cleave ptRNA under 

0 these conditions but was unable to cleave ptRNA at lower salt concentrations. Therefore the 
optimal MgCl2 concentration for ptRNA met cleavage by the holoenzyme was 20mM. 
Example 26 Determination of the optimal KCI concentration 

The S. aureus RNase P holoenzyme was tested for its ability to cleave ptRNA"" 3 
under different KCI concentrations ranging from 20mM to 1 .5M. The buffers contained 

5 lOOmM TrisCl pH8, 20mM MgCl 2 , 5% PEG 8000 and xM KCI. 

The holoenzyme was able to cleave ptRNA met at KCI concentrations as low as 
20mM. The cleavage rate increased with the salt concentration with a peak at 150mM 
where 55% of the substrate was processed. The cleavage rate decreased at KCI 
concentrations above 200mM but improved again at 1M KCI reaching a maximum at 1.5M. 

,0 The cleavage at high salt concentrations was not due to holoenzyme activity but result of 
SP RNA ribozyme activity at high KCI concentrations. 

ptRNAmet was a better substrate for SP RNA than for the S. aureus RNase P 
holoenzyme. The optimal buffer conditions for ptRNAmet cleavage by the holoenzyme 
were 150mM KCl.lOOmM Tris Cl pH8, 20mM MgCl 2 , 5% PEG. 

'.5 Example 27 Strain selection, Library Production and Sequencing 

The polynucleotide having the DNA sequence given in SEQ ID NO: 1 was obtained 
from a library of clones of chromosomal DNA of Staphylococcus aureus in E. coli. The 
sequencing data from two or more clones containing overlapping Staphylococcus aureus 
DNAs was used to construct the contiguous DNA sequence in SEQ ID NO:l. Libraries 

50 may be prepared by routine methods, for example: 
Methods 1 and 2 below. 

Total cellular DNA is isolated from Staphylococcus aureus WCUH 29 according to 
standard procedures and size-fractionated by either of two methods. 
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Method 1 

Total cellular DNA is mechanically sheared by passage through a needle in order to 
size-fractionate according to standard procedures. DNA fragments of up to 1 Ikbp in size 
are rendered blunt by treatment with exonuclease and DNA polymerase, and EcoRI linkers 
5 added. Fragments are ligated into the vector Lambda ZapII that has been cut with EcoRI, 
the library packaged by standard procedures and E.coli infected with the packaged library. 
The library is amplified by standard procedures. 
Method 2 

Total cellular DNA is partially hydrolyzed with a one or a combination of 

10 restriction enzymes appropriate to generate a series of fragments for cloning into library 
vectors (e.g., Rsal, Pall, Alul, Bshl235I), and such fragments are size-fractionated 
according to standard procedures. EcoRI linkers are ligated to the DNA and the fragments 
then ligated into the vector Lambda ZapII that have been cut with EcoRI, the library 
packaged by standard procedures, and E.coli infected with the packaged library. The library 

15 Example 28 Structure Probing 

S. aureus RNase P RNA was in vitro transcribed by T7 RNA polymerase and its 
structure was probed with a variety of enzymatic and chemical probes. Probing data 
indicates that the S. aureus RNase P RNA adopts a similar, but unique secondary structure 
to that of B. subtilis RNase P RNA. Filter binding and gel-shift assays showed that S. 

20 aureus RNase P protein binds to the RNA specifically. The binding constant of the 
complex was measured to be 20 nM. The S.aureus holoenzyme, as well as the RNA 
component alone, have been shown to be functional and the holoenzyme demonstrated a K m 
of 53 ± 4 nM and a k a of 3.4 + 0.1 min 1 with cloned 5. aureus pre-tRNA ptK as substrate 
under the experimental conditions. (I) Frank, D. N. an Pace, N. R. Ann. Review Biochem. 

25 (1998), 67, p. 153-80. (2) Massire, C. Jaeger L., and Westhof, E. J. Mol. Biol. (1998), 279, 
p. 773-93. 

A partial homology to the B. subtilis RNase P RNA sequence was identified from a 
randomly sequenced S. aureus genomic library. The complete structural gene encoding the 
RNase P RNA was amplified and cloned by a 3' primer of known sequence and a 5' 
30 degenerate primer designed to form the conserved PI helix. The 5 '-end sequence of the 
RNA was confirmed by primer extension using endogenous S. aureus RNA. S. aureus 
protein gene, spp, was identified in a similar manner. The gene was amplified, cloned, and 
expressed in E. coli as a maltose binding fusion. The MBP tag was cleaved off by Factor 
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Xa and the native protein was purified by column chromatography. The RNA secondary 
structure and potential tertiary interactions were studied by enzymatic and chemical 
probing. The enzymatic probes included RNase Tl, U2, VI , and T2. Chemical probes 
included DEPC. The RNase P RNA/protein interactions were investigated by gel mobility 
5 shift and filter binding experiments. A cloned pre-tRNA phl ' was used as a substrate for 
kinetic characterizations for both the holoenzyme and RNA only reactions. The 
endogenous RNA used in primer extension experiments was isolated from 5. aureus cells 
by TRIZOL method. 

S. aureus RNase P gene sequence 

10 The entire gene sequence (398nts) was recovered by a 3' primer of known sequence 

and a 5' degenerate primer designed to form the PI helix. The 5'-end sequence was 
confirmed by primer extension usingendogenous S. aureus WCUH29 RNA. The 5' most G 
could not be distinguished by primer extension experiments, however, this nucleotide (nt) is 
contained in the clone we used to transcribe RNase P RNA which has been shown to be 

15 functional. 

Michaelis Menten Plot of RNase P holoenzyme kinetics was determined under the 
following conditions: lx buffer: lOOmM Tris-Cl, pH7.0, 150 mM KC1, 10 mM MgC12, 5% 
PEG, [E] = 20 nM. Results from at least two independent experiments for holoenzyme: 
K m =53±4 nM and k cal =3.4±0.1min '.RNase P RNA-only kinetics in the following buffer: 50 
20 mM Tris, pH8.0, 1 M NH 4 CI, 200 mM MgCl 2 , and 5% PEG . Pre-tRNA^ is efficiently 
processed by the ribozyme. In both experiments, a cloned pre-tRNA ph "was used as 
substrate. 

Binding isotherm of RNase P RNA obtained by filter binding assay was assessed. 
The 3 '-end labeled RNA was mixed with the protein in the binding buffer containing 20 

25 mM K-Hepes, pH8.0, 0.01% NP-40, 400 mM NH 4 OAc, 1 0 mM MgCl,, and 5% glycererol. 
The K, value was generated by fitting the data with Grafit 4 one-site ligand binding 
equation: % binding = plateau/(l + K d /[S. aureus protein]). K, = 8 ± 1 nM from at least 
three independent experiments. 

Structural probing of 5'-end labeled S. aureus RNase P RNA reveled probing data 

30 demonsrating that RNase VI cleavage occurred at 8-11; 16-23; 30-33; 37-54; 64-84; 96- 
105; 119-130; 150-156; 166-169; 381-184; RNase T2 cleavage: 24-27; 55-60; 86-91; 106- 
108; 135-138; 170-177 (but was weak) Nucleotides (also "Nts" or "nts") 139-149 
(P10.1a(3') and the internal loop) and PI 2(3') are not accessible to both enzymes, 
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indicating that these nts may be buried in the structure. Nucleotides 166-169 (Jl 1/12) were 
sensitive to RNase V 1 cleavage, indicative of helical nature of the sctructure. 

Structural probing of 5'-end labeled S. aureus RNase P RNA using DEPC revealed 
cleavage data that agrees well with enzymatic data providede herein. Strong cleavage was 
5 observed mainly in loops, e.g. nts 159-161. A21 1-213.A203, 204, and A169 only showed 
mild reactivity indicating possible tertiary interactions. These data suggest that A 130 may 
basepair with U141, and U129 basepairs with A 142, with A131 as one nt bulge. 

Structural probing of 5' -end labeled S. aureus RNase P RNA revealed strong 
cleavage observed mainly in loops, e.g. L5.1 and L8. Moderate cleavage of A69 suggested 
10 that J5.1/7 is single-stranded. A106 and A107 only exhibited mild cleavage indicating a 
possible tertiary interaction. The strong cleavage of A59 does not support a tertiary 
interaction of this nt which is proposed for this nt with nt in L15.1 in B. subtilis structure. 

Structural probing of 3 '-end labeled RNA using DEPC showed strong DEPC 
cleavage at 292-294 (LI 5.2), and 271 (bulged A). No obvious cleavage in A266 and 
15 A267(L15.1) was observed indicative of tertiary interactions. Nts 281-286 (P15.2(5')) 
showed RNase VI activity, whereas nts 290-294 (LI 5.2) were sensitive to RNase T2. 

Structural probing of 3'-end labeled S. aureus RNase P RNA using nucleases VI 
and T2 revealed data that strongly indicates the existence of helix PI 5.2 and unstructured 
L15.2. DEPC data is consistent with VI and T2. DEPC Strong cleavage occurred in loop 
20 L15.1, L15.2 and a bulged A271. Mild cleavage was seen in part of J 15.2/2. Minor 
cleavage of A266 and A267 indicates that these two nts may be involved in tertiary 
interactions. A332 in LI 9 is likely to be paired with U349 as both A332 and C331 were 
sensitive to RNase VI . 

Structural probing results demonstaretd that the conserved regions of S. aureus 
25 RNase P RNA adopt similar, though unique structural properties to its E. coli and B. 
subtilis counterparts. Moreover, the conserved regions PI, P4, P12, P15, P15.1, and P15.2 
have been confirmed by RNase VI cleavage. Strong DEPC cleavage was observed in L8, 
J10.1/11, L15, L15.1, and L15.2, which further supports the structural model. Strong 
DEPC cleavage of LI 5.2 was observed indicating this region is not structured. VI cleavage 
30 data supports evidence for P4 long range interaction. Evidence supports possible internal 
loop interaction in L19. P12(3') and J12/11 are mainly not accessible to the enzymes but 
mildly accessible to smaller chemical probes such as DEPC. This phenomenon was further 
observed by primer extension experiments, indicating this region is most likely involved in 
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tertiary interactions. Part of Jl 1/12 was sensitive to RNase VI cleavage indicative of 
possible helical nature. 

Example 29 The structure of Ribonuclease P protein from S.aureus reveals a 
unique binding site for single-stranded RNA molecules 
5 Background 

Provided herein is a three-dimensional high resolution structure of the RNaseP 
protein from S.aureus (117 amino acids) by nuclear magnetic resonance (NMR) 
spectroscopy in solution. The protein has an ccfi-fold, similar to the ribonucleoprotein 
domain. Flexible, single-stranded nucleic acid-molecules were used as probes, to 
10 characterise the RNA binding propensity of the protein surface. 

RNaseP protein employs two different modes of RNA interactions, which occupy 
distinct areas: The NMR results identify a contiguous interaction site for flexible, single- 
stranded RNA molecules, representative of the 5'-leader sequence. Binding to this site is 
dominated by electrostatic charge and hydrogen bonding interactions, including main chain 

15 hydrogen bonds at the edge of the p-sheet. Cooperative aromatic base stacking interactions 
play a supportive role. The arginine-rich motif of RNaseP protein does not bind to single- 
stranded RNA and is therefore likely to be responsible for the binding to the highly 
structured P RNA component. Given the essentiality of RNaseP for the viability of the 
organism, knowledge of the S.aureus protein structure and insight into its interaction with 

20 RNA will help us to develop RNaseP and RNaseP protein as targets for novel antibiotics 
against this important pathogen. 

Abbreviations for Example 29 

NMR nuclear magnetic resonance 

NOE nuclear Overhauser effect spectroscopy 

25 RMSD root mean square deviation 

RNasePribonuclease P 

TOCSY total correlation spectroscopy 
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A. Biophysical characterisation 

DNA or RNA binding proteins are often thermodynamically unstable or even 
completely unfolded in the absence of their nucleic acid binding partners. This complicates 
5 the search for buffer conditions under which the protein is both stable and monodisperse in 
the concentration range required for NMR spectroscopy (0.5-2 mM). Literature data on 
RNaseP protein homologues from different species indicate a particularly wide range of 
thermodynamic stability: The B.subtilis protein forms a stable fold (T m = 66 °C; 
Nirajanakumari et al., 1998). In contrast, the E.coli protein is very iinstable and aggregates 
10 rapidly under physiological conditions, even though it may be stabilised to a limited extent 
at extremely high ionic strength ([urea]i/ 2 = 1.7M in 3.6M potassium chloride; Gopalan et 
al., 1997). 

Preliminary tests indicated that the S.aureus RNaseP protein may fall between these 
extremes: The protein is apparently unstructured at pH 4.8 but becomes more structured 

15 with increasing pH. A more careful thermodynamic analysis by urea-induced denaturation 
(Fig. 1) illustrates that the protein unfolds easily ([urea]i/ 2 = 1M) at physiological pH and 
low salt concentration. The transition is fully reversible and a fit to a two-state model (Pace 
et al., 1989) provides an estimate for AG = 10 kJ/mol. Similar to E.coli, thermodynamic 
stability was found to increase strongly at higher ionic strength. Titration with sodium 

20 chloride at the midpoint of the unfolding curve, i.e. in the presence of a constant urea 
concentration of 1 M, leads to an almost complete recovery of the native fold at a salt 
concentration of 250 mM (Fig. 1). 

Under the conditions finally chosen for NMR experiments (pH 6.0, 500 mM sodium 
chloride, 25 °C) NMR samples with protein concentrations of up to 1.5 mM and good long 

25 term sample integrity (> 6 months) were obtained. The determination of the self-diffusion 
constant under these conditions by NMR spectroscopy (Altieri et al., 1995) indicates an 
apparent molecular weight of MW app =12 kD (D = 1 .135xl0"6 cm 2 /s). This is slightly below 
the calculated molecular weight of 13.7 kD (data not shown), suggesting that, despite its 
marginal thermodynamic stability, the protein is perfectly monodisperse under NMR- 

30 sample conditions. As illustrated in Fig. 2a, NMR spectra of RNaseP protein recorded 
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under these conditions were of excellent quality, with good chemical shift dispersion, 
resolution and strong NOE-relays. 

B. NMR-spectroscopy and structure calculation 

Standard triple resonance through-bond methodology was employed to obtain sequence 
5 specific resonance assignments, using a 1 5 N, 1 3 C doubly labelled sample of RNaseP 
protein: 3D-HNCA, 3D-CACONH and 3D-CBCA(CO)NH yielded essentially complete 
assignments of amide *H, I5 N, C a and cP resonances. Fig. 2b exemplifies the assignment 
procedure for residues 13-18 and illustrates the quality of spectra used. Starting from these 
spin systems, assignments were then extended to the attached protons and other side chain 
0 1 H and 1 3 C spin systems with 3D-HCC(CO)NH and 3D-HCCH-TOCSY experiments. All 
aliphatic side chains spin systems were assigned at this stage, except the outer positions of 
lysine and arginine which are severely overlapped in this highly basic protein (19 lysine 
and 8 arginine residues). Peripheral Nrb-groups and aromatic side chains were assigned 
during the course of the structure calculations on the basis of their NOE-connectivities to 
5 the cPh or C7H protons. Intra-aromatic assignments were obtained by the TROSY-type 
HCCH-COSY method (Pervushin et al., 1998). Assignments for residues Metl, Asn 35 and 
Asn 36 are considered tentative, because these spin systems could not be connected to the 
rest of the molecule by either through-bond or NOE-relayed connectivities. Chemical shift 
data have been deposited in the BioMagRes data base (http://www.bmrb.wisc.edu). 

,0 Intramolecular NOE upper distance constraints were obtained primarily from two sets 

of three-dimensional 15 N and 13 C correlated j^H^Hj-NOESY spectra with mixing times 
of 60 and 150 ms each. An additional set of 13 C correlated [ 1 H, 1 HJ-NOES Y spectra was 
recorded with the 13 C offset centred in the aromatic region of the spectrum. For the 
structure calculations, upper limit constraints derived from the 60 ms and 120 ms were used 

15 simultaneously, but a significantly tighter calibration function could be applied to data sets 
with the shorter mixing time. Structure calculations were performed using the torsion angle 
molecular dynamics algorithm implemented in the program DYANA, starting from random 
conformers (Guntert et al., 1997). Constraints that were considered ambiguous due to 
chemical shift degeneracy and, simultaneously, spatial proximity of the corresponding atom 

\0 pairs in preliminary structures were systematically analysed with the NOAH option of the 
DYANA program and discarded whenever several different assignment possibilities were 
consistent with the calculated structure. In addition, angle constraints were calculated 
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from '3c a and ' 3 cP secondary chemical shifts. After several iterative cycles of spectral 
analysis and structure calculations a total of 1598 restrictive distance constraints were 
obtained and used as input for the final calculation. 

Table 4 and Figure 3a-c) provide an overview of the statistical parameters for the final 
5 structure calculation. A total of 100 conformers were calculated, of which the 20 

conformers with the smallest residual violations (i.e. the lowest DYANA target function) 
were selected to represent the solution structure of RNaseP protein. Although RMSD 
calculations including side chain non-hydrogen atoms are somewhat biased by the large 
number of poorly ordered long side chain residues in RNaseP protein, the majority of the 
0 polypeptide backbone, including the regular secondary structures, is well defined with an 
RMSD of 0.62 A for the constrained regions of the peptide (Fig. 3b). The N- and C- 
terminal residues and the long loops comprising residues 36-41 and 50-55, which are 
poorly connected by long range NOE constraints with the bulk of the molecule (Fig. 3a), 
show increased disorder (Fig 3b, 4b). Consistently lower heteronuclear *H — NOE 
5 intensities suggest that these peptide segments are relatively mobile and poor structural 
definition is not due to spectral overlap (Fig. 3c). 

C. The solution structure of S.aureus RNaseP protein 

RNaseP protein (Fig. 4) has an afi-fold, a common architecture in RNA-binding 
domains (Varani, 1997). The central four-stranded P-sheet ($y. residues 23-28; P2'- 31-34; 

10 P3: 82-86; P4: 44-49) is arranged with a sheet topology of pi -P2-P4-P3 (from right to left 
in Fig. 4b). The P-sheet is surrounded by three a-helices (a] : 12-21 ; OL2'. 56 -75; 0:3: 95- 
109) and several 3]Q-helical turns (residues 7-10 and 88-92, not shown in the schematic 
drawing of Fig. 4b). One face of the p-sheet forms the hydrophobic core of the protein by 
packing together with helices 0:3 and 0:3. The other face forms a characteristic groove 

15 together with helix cq , which is oriented perpendicularly across the P-sheet. This groove 

has been identified by chemical crosslinking as the primary interaction site for the 5'-leader 
sequence (Nirajanakumari et al., 1998; in the following: 'substrate binding site'). The 
topology of the S.aureus RNaseP protein is similar to the recent crystal structure of the 
B.subtilis protein, which has 48% sequence identity (Stams et al., 1998). The general 

\0 structural features of RNaseP protein, i.e. a four-stranded P-sheet, which on one face forms 
a hydrophobic core, while the other face presents a binding site for RNA ligands, are 
common among RNA-binding ocP-proteins (Varani, 1997) and in particular the 
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ribonucleoprotein family (Nagai et al., 1990). A major difference between RNaseP protein 
and the RNP fold is the parallel orientation of strands P3 and P4. which occasions an 
unusual lefthanded crossover connection in RNaseP protein. 

The multiple sequence alignment in Fig. 5 gives an overview of phylogenetically 
5 conserved residues, which are most likely to play an important role either in maintaining 
the structural integrity of the fold or in ligand interactions. Within a family of sequences 
from the SwissProt database, including examples from both gram-positive and gram- 
negative bacteria, conserved residues display a characteristic distribution between the 
hydrophobic core and the protein surface. Of the 20 residues (indicated by * in Fig. 5) that 

10 are buried in the hydrophobic core between the p-sheet and helices CI2 ana " a 3' on ^ tnree 
(Ile62, Ue77 and Vail 05) are conserved throughout the family. The much smaller interface 
between helix a\ and the P-sheet has 4 residues (indicated by ■&), two hydrophobics (Ile9, 
He 18) and two charged residues (Arg44 and Asp81), all of which are conserved. Arg44 and 
Asp81 are in close vicinity and very likely to form a salt bridge, although this cannot be 

15 observed directly by NMR. The Arg49->His mutant in E.coli (corresponding to Arg44 in 
S.aureus), is known to confer temperature sensitivity on the reconstituted holoenzyme, 
suggesting that this electrostatic contact has an important structural role (Baer et al., 1989). 
The general scarcity of conserved residues in the hydrophobic core may explain the huge 
variations in thermodynamic stability of RNaseP proteins. 

20 The majority of conserved residues are solvent exposed, suggesting that RNaseP 

protein has evolved predominantly in response to requirements for intermolecular 
interactions with its RNA ligands. Conserved residues are not dispersed over the entire 
protein surface, but cluster in a narrowly defined area of the three-dimensional structure 
(Fig. 4b), including the basic residues of the RNR-box at the N-terminus of helix <X2 

25 (residues 59-67) and several aromatic or hydrophobic residues within the substrate binding 
site. Site-directed mutagenesis studies on the E.coli protein have implicated some of these 
residues in enzymatic activity and substrate specificity (Gopalan et al., 1997), in particular 
Phel5 and Tyrl9 on helix cci as well as several residues in the RNR-box. In general, 
positive surface charges, which are very likely to be involved in ligand interactions, are also 

30 very unevenly distributed (Fig. 4c): The RNR-box comprises a very dense cluster of five 
charges, but the substrate binding site consists predominantly of hydrophobic and 
uncharged hydrophilic residues. Given the large number of positive surface charges in 
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RNaseP protein ( 1 9 Lys and 7 Arg. not including Arg44), it is remarkable that outside the 
RNR-box only very few, namely Lys51 and 52 on the boundary of the substrate binding 
site, are conserved throughout the family. Phylogenetic conservation and the high 
proportion of arginine suggest that the RNR-box is involved in very specific RNA 
5 interactions, whereas the other basic surface residues are restricted to a more generalized 
role in providing a pool of counter ions. 

D. Localisation of the RNA binding site 

An important question in exploring potential substrate interactions of RNaseP protein 
from a structural point of view is to differentiate the effects of non-specific electrostatic 

0 interactions with the phosphate backbone from more specific interactions that recognise 
base or sugar-related features of RNA. In the absence of three-dimensional structural data 
on the RNaseP holoenzyme, isolated RNaseP protein affords a simple low molecular 
weight model to investigate with residue-specific resolution RNA-protein interactions with 
small flexible substrate analogues. In the model system of the invention, localisation of the 

5 RNA binding site by NMR-spectroscopy is infered from backbone ] HN and 15 N chemical 
shift deviations in the [ I H^ 5 N]-HSQC spectrum of RNaseP protein, that are induced by 
titration with small, single-stranded oligonucleotides. Although this simple model cannot 
aim to replicate the precise 3-dimensional structure of the holoenzyme, it is believed that 
low affinity binding of oligonucleotides to isolated RNaseP protein is governed by the same 

'0 generic binding propensities of protein and RNA as the interaction between the holoenzyme 
and the flexible leader sequence and that it will involve the same structural features. 
Experimentally, this means that the choice of suitable model oligomers is limited by the 
following two considerations: 

(1) Little is known about the sequence specificity of the holoenzyme for nucleotides -4 
15 to -8 of the leader sequence, but 5'-leader sequences of RNaseP substrates are in general 
highly variable in sequence as well as in nucleotide composition. AU-rich leader sequences 
are favoured in B.subtilis (e.g. B.subtilis: Wawrousek et al., 1984; Green et al., 1985; 
Garrity & Zahler, 1993) and are cleaved preferentially compared to GC-rich sequences, 
however it is not clear to what extent this effect depends on the protein subunit (Loria and 
50 Pan, 1998). 
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(2) Steric guidance of the substrate by RNA-P RNA interactions downstream from the 
cleavage site, involving the T-stem loop and the 3-CCA trinucleotide, helps to ensure that 
the 5 -leader sequence binds to the protein subunit not only in proper register, i.e. at 
nucleotides -4 to -8, but also in the correct direction (Nirajankumari et al., 1998). It is 
5 known from crosslinking experiments that in the holoenzyme the 5'->3' vector is oriented 
vertically across the fj-sheet from strand (5 j to P3, but it is not clear, given the present lack 
of knowledge about sequence specificity and affinity, whether this orientation can be 
replicated in the absence of P RNA. 

These considerations suggest that mononucleotides and small homo-oligomers may be 
0 the most appropriate choice as model substrates for NMR investigations, because it does 
not presuppose any a priori knowledge about sequence specificity and because binding will 
be independent of the register or the chain direction relative to the binding site. Whereas 
mononucleotides are sufficient to probe the general binding propensities on the surface of 
RNaseP protein, homo-oligomers are used to potentiate cooperative binding phenomena 
5 that may require multivalent interactions over extended areas of the protein surface. 

A first series of titrations was performed with mononucleotides, which represent the 
smallest possible RNA fragments (Fig. 6a). Titrations were performed with the four 
different ribonucleotides as well as 2'-deoxy-AMP. The nucleosides of cytidine and uridine 
were also used to distinguish the relative significance of electrostatic charges (In all cases 

10 the data in Fig. 6a are for the highest concentration of 40 mM only; guanosine and 
adenosine nucleosides were not tested because of poor solubility). Because RNA 
interactions with this highly basic protein (calculated pi = 10.5) were expected to have a 
significant electrostatic component, the salt concentration was reduced to 100 mM sodium 
chloride and the pH raised to 7.0 to stabilise the protein under these conditions. No 

>5 saturation was achieved at ligand concentrations up to 40 mM and only one set of spectral 
signals was observed in all experiments, indicating that ligand binding is very weak and 
kinetics are in the fast exchange regime. Chemical shift changes are generally small (i.e. < 
0.2 ppm for ^H), indicating that the three-dimensional structure of RNaseP protein is 
minimally affected by ligand binding. Chemical shift differences for N-and C-terminal 

$0 residues are not considered in the following discussion, because they may reflect a general 
stabilisation of these rather flexible peptide segments rather than a nucleotide-specific 
interaction. 
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Most residues of RNaseP protein experience some concentration-dependent chemical 
shift changes in the presence of mononucleotide ligands, but the titration data nonetheless 
point to several preferential interaction sites: Four distinct regions along the polypeptide 
sequence experience either larger than average chemical shift differences or, alternatively, 
5 discriminate between the different nucleotides in a charge, base or sugar-specific manner. 
Broadly, these include the following regions: (A) helix aj and ^-strands Pi and P2 
(residues 10-34); (B) the loop between p-strand P3 and helix 0*2 (residues 47-54); (C) p- 
strand p 4 and the following loop (residues 82-90); (D) the C-terminal segment of helix 013 
(residues 101-109). A comparison of the binding profiles of the different ligands in Fig. 6a 

10 reveals a second element of specificity, depending on the chemical features of the ligands. 
Although a detailed deconvolution of these ligand-dependent chemical shift differences in 
terms of relative contributions from electrostatic, hydrogen bonding or hydrophobic 
interactions is bound to be difficult because several different functionalities of the 
nucleotide may be involved, some qualitative observations towards an initial 

15 characterisation of structure-activity relationships in the binding pocket may be of interest 
here: 

— Interaction region A has a markedly stronger response for AMP compared to 2-deoxy- 
AMP and the other ligands, in particular at residues Ser24 and Val25 on the edge of the P- 
sheet, suggesting that the 2'-hydroxyl group and the adenine base interact cooperatively 

20 with the protein. 

— Region B interacts differentially with nucleotides and does not respond to uncharged 
nucleosides. This suggests a role for the two conserved lysine residues in this region, 51 
and 52, in very specific electrostatic interactions with the ligand (cmp. alignment in Fig. 5). 

— In region D all ligands, nucleotides and nucleosides, have very similar chemical shift 
25 profiles, indicating a binding mode independent of electrostatic charge and a preference for 

the base moiety of the ligand. 

A second series of titrations was performed with homo-oligomeric nucleic acids. Figure 
6b summarises the results with three different ligands, Air> Cjq and (2'-deoxy-A)io at two 
different ionic strengths (100 and 300 mM sodium chloride). Generally, homo-oligomers 
30 are found to bind in the same regions of the protein as mononucleotides, but about 40-100 
times more strongly. While it seems likely that much of this increase in affinity is 
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attributable to the cooperative effects of multiple electrostatic charges, there are some 
interesting site-specific effects of homo-oligomer binding that are not observed for 
mononucleotides: An increase in selectivity is observed for residues 15-20 which are 
affected by homo-oligomers but not by mononucleotides. This includes residues Phel5 and 
5 Tyrl9, which are essential for enzymatic activity and have been implicated as candidates 
for aromatic base-stacking interactions (Gopalan et al., 1997; Stams et al., 1998). Another 
characteristic feature of homo-oligomer binding is the much better Iigand discrimination. 
On titration of RNaseP protein with A\q in 100 mM sodium chloride it was observed that 
instantaneous coprecipitation due to charge neutralisation occurred. Even at 300 mM 

10 sodium chloride, the lowest salt concentration at which the interaction with Ajq could be 
measured, the NMR linewidth is significantly broadened, an indication for the presence of 
oligomeric aggregates. In contrast, (2-deoxy-A) ]0 and Cjq have only a moderate effect at 
100 mM and interact very weakly at 300 mM salt (Fig. 6b). This suggests that the RNaseP 
protein substrate binding site has a high sugar and base-related selectivities of RNA over 

15 DNA and for A over C, respectively. It is believed to be the first experimental observation 
of a significant base preference in RNA recognition by RNaseP protein. These initial results 
open a perspective for the develpoment of more specific ligands as RNaseP inhibitors, 
based on established nucleotide chemistry. Also, it will be interesting to investigate 
whether the base or sequence preference of RNaseP protein is correlated with sequence 

20 specific recognition of the leader sequence by the RNaseP holoenzyme. 

When mapped onto the three-dimensional structure of RNaseP protein, peptide 
segments A-D form a contiguous surface area (Fig. 7) along the direction of the groove 
formed between helix cq and the (3-sheet and extending to the C-terminal part of helix (X3. 
This area measures approx. 18-20 A, which is sufficient to accommodate, in an extended 

25 conformation, those 4-6 nucleotides of the leader sequence that are postulated to interact 
directly with RNaseP protein. The 6 positive charges in and around the substrate binding 
site, not counting the RNR-box, provide sufficient counterions for a nucleotide of this size. 
The binding surface as outlined by NMR chemical shift data of the polypeptide backbone 
may slightly overestimate the true extent of the actual RNA-protein contact area, because 

30 the minute structural changes causing chemical shift changes may propagate beyond those 
residues that are in immediate contact with the ligand. However, the NMR-derived RNA 
interaction surface of RNaseP protein is comparable in size to that of other RNA binding 
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proteins with a-fold. For example, the Ul A spliceosomal protein recognises 6-7 
nucleotides (Oubridge et al., 1994). 

The location as well as the size of the NMR-derived interaction surface is in very good 
agreement with the substrate binding site that was postulated on the basis of chemical 
5 crosslinking data (Nirajanakumari et al.. 1 998). This may seem surprising, because the 

isolated RNaseP protein exposes a much larger surface area to potential ligands than in the 
holoenzyme, where parts of the protein are likely to be shielded by the vastly larger P RNA 
component. In particular, the cluster of positively charged residues in the RNR-box, which 
do not crosslink with the 5 -leader sequence in the holoenzyme and are therefore postulated 

10 to bind to P RNA, are only marginally disturbed in the model system, although the RNR- 
box is bordering on the substrate binding site (cmp. Fig 7 and 4c). Neither is there evidence 
for RNA interactions on the "backside" of the molecule, which is made up by helices a.7 
and 013 and the flexible loop between strands P3 and fJ 4 . The unexpectedly close 
correspondence between the substrate binding sites in a low molecular weight model and 

15 the holoenzyme suggests that the correct positioning of single-stranded leader sequences on 
the protein surface relies primarily on intrinsic structural features of the substrate and the 
protein rather than on steric guidance by the P RNA subunit. Furthermore, it is an 
indication for the existence of two different RNA binding sites on RNaseP protein to meet 
the very contrasting requirements of P RNA and substrate RNA. 

20 Compared to substrate RNA, binding of to P RNA, requires much higher affinity and 

specificity in order to locate the protein at the correct position within the framework of the 
large P RNA molecule. This is illustrated by the very low dissocciation constants for the 
RNaseP holoenzyme (AG assoc . = -55.6 kJ/mol in E.coli; Talbot and Altman, 1994). The 
arginine-rich motif of the RNR-box represents the most likely candidate site for high 

25 affinity binding to P RNA, similar to the well established example of the helical arginine- 
rich motif from HIV-1 Rev protein, which interacts with the distorted major groove of 
double- stranded RNA (Battiste et al., 1996; Peterson and Feigon, 1996: Ye et al., 1996). 
The specificity of an isolated arginine-rich peptide motif for double-stranded RNA is 
usually low and secondary interactions at other sites of the full length protein are necessary 

30 to differentiate between cognate and non-cognate RNA (Varani, 1997). In the case of 

RNaseP protein such sites have not yet been identified and the fact that RNaseP proteins 
and P RNA from different bacterial species cross-interact to reconstitute active holoenzyme 
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suggests that specificity for the correct binding site may depend on the three-dimensional 
structure of the P RNA molecule, rather than a particular nucleotide sequence. The 
arginine-rich motif of the RNR-box is expected to bind preferentially to double-stranded 
RNA, because binding to a conformational^ less restricted single-stranded ligand, such as 
5 the 5'-leader sequence, would incur a heavy entropic penalty in ordering the highly flexible 
arginine side chains to establish extensive intermolecular hydrogen bonding interactions. 
This hypothesis is consistent with the inability to observe interactions between the RNR- 
box and single-stranded homo-oligomers in the experimental model. 

Conversely, in order to bind to and efficiently catalyze the cleavage of a wide variety of 

0 different 5 -leader sequences, the substrate binding site of RNaseP protein has to carefully 
balance relatively non-specific long-range electrostatic interactions with more specific and 
conformationally more demanding short-range interactions, in particular hydrogen bonding, 
base-stacking with aromatic residues as well as general hydrophobic contacts. Basic 
residues in the substrate binding site are generally not highly conserved, suggesting that the 

5 overall charge, rather than the precise location of individual charges is important for 

electrostatic interactions with the phosphate backbone. The aromatic residues in the binding 
site, Phel5 and Tyrl9 are highly conserved and likely to engage in aromatic base stacking 
interactions. This is consistent with the experimentally observed preference for purine over 
pyrimidine bases. However, large parts of the interaction surface are devoid of 

0 phylogenetically conserved residues, in particular region A. This region has a large number 
of unoccupied main chain hydrogen bonding partners, in particular carbonyl oxygen atoms, 
at the solvent exposed edge of strand the P1-P2 loop and the C-terminus of helix a] 
(cmp. Fig. 4c, 7). In addition, the binding site contains a variety of side chain hydroxyl 
groups, i.e. Ser48, Tyrl9, Tyr33, Ser24, which are highlighted in Fig. 4c. Experimentally, a 

:5 particularly strong discrimination of RNA over DNA as well as an ability to bind 

uncharged nucleosides, is observed at residues Ser24 and Val25 at the edge of the p-sheet 
(Fig. 6a). Given the generally stronger hydrogen bonding capability of RNA compared to 
DNA, the strong discrimination of RNA over DNA suggests that hydrogen bonding may be 
the predominant mode of interaction. It is believed hypothesise that hydrogen bonding to 

>0 the polypeptide backbone is particularly important, because it does not depend on 
phylogenetic conservation of the amino acid side chain and the relatively high 
conformational constraint on the backbone carbonyl oxygens is entropically more 
favourable for interactions with flexible ligand molecules. 
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E. Cloning, expression and purification of S. aureus C5 protein 

Random sequencing of a S. aureus genomic library revealed a partial sequence 
homologous to the B. subrilis RNase P protein. Details describing cloning of the S. aureus 
rnpA gene are described elsewhere. Briefly, a two-step PCR method was devised using 

5 primers specific to the partial sequence in conjunction with random primers in a modified 
PCR amplification to obtain full length sequence. The entire S. aureus rnpA structural gene 
was recovered as a single DNA fragment by PCR and inserted into the pMalc2 protein 
expression vector (New England Biolabs). E.coli W31 10 pRI952 cells (Kane, 1995) were 
transformed by electroporation and grown on LB agar plates containing the appropriate 

0 selective antibiotic. The insert sequence was confirmed by dideoxy sequencing {{fmol ™ 
DNA Sequencing System, Promega). 

A modified M9 minimal medium with i5 N-ammonium chloride (lg/1, Cambridge 
Isotope Laboratories) and either 13 C-glucose (2g/l, fermentation grade, Martek) or 

5 unlabeled glucose as sole nitrogen and carbons sources were used for the preparation of 
uniformly ^N^lS^.^ouble labeled or single labeled protein. The protein was 
expressed as a fusion with maltose binding protein (MBP). The fusion protein was 
solubilized in 400mM sodium chloride and 1 % CHAPS and purified by cation exchange 
chromatography on a Biorad Biologic system (20 mM MES buffer pH 6.0, gradient 0-1M 

.0 magnesium chloride). The fusion protein was then cleaved with factor Xa at room 

temperature (Hematologic Technologies Inc.). After cleavage the free C5 protein, which is 
significantly more basic compared to the MBP-RNaseP protein fusion, is purified to 
homogeneity in a second cation exchange step. The identity of the product was confirmed 
by electron spray mass spectrometry {measured 13.653 kD vs. calculated 13.655 kD). The 

:5 single free cysteine residue in S aureus RNaseP protein (Cys34) was modified with 

iodoacetamide to prevent oxidative dimerization and thus improve the long term stability of 
the NMR-sample. This modification did not affect enzymatic activity on reconstitution of 
the RNaseP holoenzyme. 

F. Biophysical characterization 

>0 Because S. aureus RNaseP protein does not contain a tryptophan residue allowing for 

fluorescence measurements, equilibrium chemical denaturation in urea was monitored in 
the ID NMR spectrum by observation of the total intensity of the methyl signals. Spectra 
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were recorded on a Varian Unity300 spectrometer equipped with a 4mm nanoprobe. The 
data were fit to a 2-state model as described in Pace et al. (1989) to obtain estimates of 
[urea]i/ 2 . The self diffusion constant of C5 protein was determined by NMR with the 
method of Altieri et al. (1995) and the apparent molecular weight calculated using 
5 lysozyme, ubiquitin and GLP-1 peptide as molecular weight references. 

G. NMR-Spectroscopy 

NMR experiments were performed on Bruker DRX400 and DRX600, equipped with 
three channels and pulsed field gradients. Well established triple resonance methodology 
was applied to obtain assignments of the ] H, 15 N and 13 C backbone and CP resonances: 

10 ( 1 5 N, 1 H)-HSQC, 3D-constant time (ct)-HN(CO)CA, 3D-ct-HNCA and 3D-ct- 

CBCA(CO)NH (Bodenhausen & Ruben, 1980; Ikura et al., 1990; Grzesiek & Bax, 
1992a,b). Additional side chain 13 C and *H assignments were determined with HCCH- 
TOCSY (Kay et al., 1993) and HCCH-TROSY for aromatic residues (Pervushin et al., 
1998). Distance constraints were derived from 3D i5 N-correlated or 13 C-correlated 

15 ( 1 H, 1 H)-NOESY experiments with mixing times of 60 and 1 20ms (Fesik & Zuiderweg, 
1988). All experiments were acquired using a pulsed field gradient enhancement scheme 
(Kay, 1993; Schleucher et al., 1994). Data were processed with the XWINNMR software 
(Bruker) and analysed with the program XEASY (Bartels et al., 1995). 

NMR experiments on the free protein were performed at pH 6.0, 26 °C, 500 mM 
20 sodium chloride and a protein concentration of 1 .5mM (20mM sodium acetate, 20 mM 
sodium phosphate, 5% (v/v) 2 H20). Protein-ligand interaction studies were performed at 
pH 7.0, 20 °C,100 or 300 mM sodium chloride and a protein concentration of 0.8 mM. 

H. Structure determination 

The program package DYANA was used to translate NOE peak intensities from the 
25 3D-NOES Y spectra into distance constraints. Two sets of NOES Y spectra with different 
mixing times, 60 and 150 ms, were used to derive upper distance constraints, with a looser 
calibration function for the longer mixing time. For intraresidual and sequential NOEs and 
for the calculation of local angle constraints with the HABAS routine (Guntert et al., 1991) 
only 60 ms NOESY spectra were used. Hydrogen bond constraints were not introduced 
30 throughout the calculation. Stereospecific assignments for the diasterotopic methyl groups 
of valine and leucine were obtained using biosynthetically directed fractional l 3 C-labeling 
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(Neri et al., 1989). ¥,(|>- Angle constraints for polypeptide backbone were generated from 
C a secondary chemical shifts {Spera & Bax. 1991). A total of 100 DYANA conformers 
was calculated, using the default simulated annealing macro, and the 20 conformers with 
the lowest target function values were considered to represent the solution structure of 
5 RNaseP protein. No further energy minimization was performed on DYANA conformers. 
Structures were analyzed and displayed with the programs MOLMOL (Koradi et al., 1996) 
and PROCHECK (Laskowski et al., 1996). 

The sequence alignment was prepared with the program pileup (GCG 9.0; Genetics 
Computer Group Inc.) using default parameters for gapweight and gaplengthweight). The 
10 program GeneDoc was used in preparation of Fig. 5 (Nicholas and Nicholas, 1997; Version 
2.4, www.cris.com/~ketchup/genedoc.shtml). 

Table 4: Analysis of the 20 DYANA conformers representing the solution structure of S. 
aureus C5 protein. 



1 5 Number of restrictive distance constraints 

total / long range / medium / sequential / intrares. 1 598/528/346/422/302 

Dihedral angle constraints 



after HAB AS 



472 



DYANA Parameters a 



20 DYANA target function (A 2 ) 

Residual distance constraint violations 2 

sum of violations (A) 
maximum violation (A) 

Residual dihedral angle constraint violations 3 



7.8 ± 0.5 (6.8 .. 8.5) 



24.5 ± 1.6(20.6 ..27.6) 



0.55 ± 0.1 (0.4 .. 0.7) 



25 sum (deg.) 

maximum violation (deg.) 

RMSD relative to the mean coordinates (A) D 



28.6 ± 11.4 (11.4 ..51.5) 
6.3 ±2.9 (2.9 .. 16.0) 



all residues 



1.30(1.88) 



constrained residues (8-9, 13-35, 42-1 16) 
30 Ramachandran plot c 
favoured regions 



0.62(1.28) 



68.2 % 



additional allowed 



25.3 % 
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generously allowed 
disallowed 



a Values are given for the 20 conformers with the lowest DYANA target functions out of a 
5 total of 100 conformers calculated. The numbers given are mean values ± standard 
deviations. The minimum and maximum values among the 20 conformers are given in 
parentheses. 

b The first number refers to the polypeptide backbone. The number in parentheses refers to 
all non-hydrogen atoms. 
10 c The analysis was performed with the program PROCHECK (Laskowski et al., 1996). 
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WHAT IS CLAIMED IS: 

1. An antagonist that inhibits or an agonist that activates an activity of an 
RNasP polypetide selected from the group consisting of: a polypeptide having at least 70% 
identity to an amino acid sequence of Table 1, and a polypeptide comprising a sequence set 
forth in Table 1, wherein said activity is selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

2. A method for the treatment of an individual having need to inhibit or activate 
RNaseP polypeptide or holoenzyme comprising the steps of: administering to the individual a 
antibacterially effective amount of an antagonist that inhibits or an agonist that activates an 
activity of an RNaseP polypeptide or holoenzyme, wherein said activity is selected from the 
group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

3. A method for the treatment of an individual infected with a bacteria 
comprising the steps of administering to the individual a antibacterially effective amount of an 
antagonist that inhibits or an agonist that activates an activity of an Rnase P polypeptide or 
holoenzyme, wherein said activity is selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

4. The method of claim 3 wherein said bacteria is selected from the group 
consisting of a member of the genus Staphylococcus, Staphylococcus aureus, a member of the 
genus Streptococcus, and Streptococcus pneumoniae. 
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5. A method for the treatment of an individual having need to inhibit or 
activate RNaseP polypeptide or holoenzyme comprising the steps of administering to the 
individual a antibacterially effective amount of an antagonist that inhibits or an agoinist that 
activates an activity of RNaseP polypeptide or holoenzyme selected from the group 
consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

6. A method for the treatment of an individual infected with a bacteria 
comprising the steps of administering to the individual a antibacterially effective amount of an 
antagonist that inhibits or an agonist that activates that activates an activity of RNaseP 
selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

7. The method of claim 6 wherein said bacteria is selected from the group 
consisting of: a member of the genus Staphylococcus, Staphylococcus aureus, a member of 
the genus Streptococcus, and Streptococcus pneumoniae. 

8. A method for the treatment of an individual infected by Streptococcus 
pneumoniae comprising the steps of administering to the individual a antibacterially effective 
amount of an antagonist that inhibits or anagonist that activates an activity of Streptococcus 
pneumoniae RNAseP polypeptide or holoenzyme selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

9. A method for inhibiting an activity of RNAseP polypeptide or holoenzyme 
comprising the steps of contacting a composition comprising said polypeptide with an 
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effective amount of an antagonist that inhibits an activbity of RNAseP. wherein said activity 
is selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

10. A method for inhibiting an activity of RnaseP polypeptide or holoenzyme, 
wherein said activity is selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

11. The method of claim 10 wherein said bacteria is selected from the group 
consisting of: a member of the genus Staphylococcus, Staphylococcus aureus, a member of 
the genus Streptococcus, and Streptococcus pneumoniae. 

12. A method for inhibiting a growth of bacteria comprising the steps of 
contacting a composition comprising bacteria with an antibacterially effective amount of an 
antagonist that inhibits an activity of RnaseP polypeptide or holoenzyme, wherein said 
activity is selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

13. The method of claim 12 wherein said bacteria is selected from the group 
consisting of: a member of the genus Staphylococcus, Staphylococcus aureus, a member of 
the genus Streptococcus, and Streptococcus pneumoniae. 

14. A method for inhibiting a RNaseP polypeptide or holoenzyme comprising the 
steps of contacting a composition comprising bacteria with an antibacterially effective amount 
of an antagonist that inhibits an activity of RNaseP, wherein said activity is selected from the 
group consisting of: 

binding of said polypeptide to pre-tRNA; 
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binding of said holoenzyme to pre-tRNA; 
binding of said polypeptide to pre-tRNA and RNaseP RNA; 
cleavage of pre-tRNA bound to said polypeptide; and 
cleavage of pre-tRNA bound to said holoenzyme. 

15. The method of claim 14 wherein said bacteria is selected from the group 
consisting of: a member of the genus Staphylococcus, Staphylococcus aureus, a member of 
the genus Streptococcus, and Streptococcus pneumoniae. 

16. A method for inhibiting or activating an activity of RNaseP polypeptide or 
holoenzyme comprising the steps of contacting a composition comprising RNaseP 
polypeptide or holoenzyme with an effective amount of an antagonist that inhibits or an 
agonist that activates an activbity of RNAseP polypeptide or holoenzyme, wherein said 
activity is selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

17. A method for inhibiting a RNaseP polypeptide or holoenzyme comprising the 
steps of contacting a composition comprising bacteria with an antibacterially effective amount 
of an antagonist that inhibits an activity of RNaseP, wherein said activity is selected from the 
group consisting of: 

binding of said polypeptide to pre-tRNA; 

binding of said holoenzyme to pre-tRNA; 

binding of said polypeptide to pre-tRNA and RNaseP RNA; 

cleavage of pre-tRNA bound to said polypeptide; and 

cleavage of pre-tRNA bound to said holoenzyme. 

18. A method for the treatment of an individual infected with a bacteria 
comprising the steps of administering to the individual a antibacterially effective amount of an 
antagonist that inhibits or an agonist that activates that activates an activity of RNaseP 
selected from the group consisting of: 

binding of said polypeptide to pre-tRNA; 
binding of said holoenzyme to pre-tRNA; 
binding of said polypeptide to pre-tRNA and RNaseP RNA; 
cleavage of pre-tRNA bound to said polypeptide; and 
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cleavage of pre-tRNA bound to said holoenzyme. 

19. The method of claim 18 wherein said bacteria is selected from the group 
consisting of: a member of the genus Staphylococcus, Staphylococcus aureus, a member of 
the genus Streptococcus, and Streptococcus pneumoniae. 



-82- 



WO 00/70025 



PCT/US00/13946 



SEQUENCE LISTING 

<110> SMITHKLINE BEECHAM CORPORATION 
SMITHKLINE BEECHAM pic 

<120> METHODS OF USING RNASE P REACTION 
MECHANISMS OF ACTION 

<130> GM50055 

<140> TO BE ASSIGNED 
<141> 2000-05-19 

<150> US 60/134, 802 
<151> 1999-05-19 

<150> US 60/140,044 
<151> 1999-06-18 

<160> 14 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 354 
<212> DNA 

<213> Staphylococcus aureus 
<400> 1 

atgttattgg aaaaagctta ccgaattaaa aagaatgcag attttcagag aatatataaa 60 
aaaggtcatt ctgtagccaa cagacaattt gttgtataca cttgtaataa taaagaaata 120 
gaccattttc gcttaggtat tagtgtttct aaaaaactag gtaatgcagt gttaagaaac 180 
aagattaaaa gagcaatacg tgaaaatttc aaagtacata agtcgcatat attggccaaa 240 
gatattattg taatagcaag acagccagct aaagatatga cgactttaca aatacagaat 300 
agtcttgagc acgtacttaa aattgccaaa gtttttaata aaaagattaa gtaa 354 

<210> 2 
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<211> 117 
<212> PRT 

<213> Staphylococcus aureus 



<400> 2 

Met Leu Leu Glu Lys Val Tyr Arg He Lys Lys Asn Ala Asp Phe Gly 

15 10 15 

Arg He Tyr Lys Lys Gly His Ser Val Ala Asn Arg Gin Phe Val Val 

20 25 30 

Tyr Thr Cys Asn Asn Lys Glu He Asp His Phe Arg Leu Gly He Ser 

35 40 45 

Val Ser Lys Lys Leu Gly Asn Ala Val Leu Arg Asn Lys He Lys Arg 

50 55 60 

Ala He Arg Glu Asn Phe Lys Val His Lys Ser His He Leu Ala Lys 
65 70 75 80 

Asp He He Val He Ala Arg Gin Pro Ala Lys Asp Met Thr Thr Leu 

85 90 95 

Gin He Gin Asn Ser Leu Glu His Val Leu Lys He Ala Lys Val Phe 

100 105 HO 

Asn Lys Lys He Lys 
115 



<210> 3 
<211> 401 
<212> DNA 

<213> Staphylococcus aureus 



<400> 3 

gttctgatat tttgggtaat cgctatatta tatagaggaa agtccatgct cacacagtct 
gagatgattg tagtgttcgt gcttgatgaa acaataaatc aaggcattaa tttgacggca 120 
atgaaatatc ctaagtcttt cgatatggat agagtaattt gaaagtgcca cagtgacgta 
gcttttatag aaatataaaa ggtggaacgc ggtaaacccc tcgagtgagc aatccaaatt 
tggtaggagc acttgtttaa cggaattcaa cgtataaacg agacacactt cgcgaaatga 
agtggtgtac gacagatggt tatcacctga gtaccagtgt gactagtgca cgtgatgagt 
acgatggaac agaacatggc ttatagaaat atcactacta g 



180 
240 
300 
360 
401 



<210> 4 
<211> 64 
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<212> DNA 

<213> Staphylococcus aureus 
<220> 

<221> unsure 

<222> (30) (35) (39) (40) 

<400> 4 

agactgaccc tagggttaag acttggcccn agggntatnn taacttactt taaaatgttt 
tcac 

<210> 5 
<211> 24 
<212> DNA 

<213> Streptococcus aureus 

<400> 5 
cgcgaagtgt gtctcgttta tacg 

<210> 6 
<211> 17 
<212> DNA 

<213> Streptococcus aureus 

<400> 6 
gaggaaagtc catgctc 

<210> 7 
<211> 24 
<212> DNA 

<213> Staphylococcus aureus 

<400> 7 
tttgttctaa ttttctcgtt atgc 

<210> 8 
<211> 87 
<212> DNA 

<213> Staphylococcous aureus 

3/5 



WO 00/70025 



PCT/US00/13946 



<400> 8 

ccatagggta gggaatgaaa acattaaccc ctcaacgcat cccgaagtct tacctcagac 60 
aaacgtttat cccatccctt acttttg 87 

<210> 9 
<211> 213 
<212> DNA 

<213> Staphylococcus aureus 
<400> 9 

gttttcattc cctaccctat ccttacttaa tctttttatt aaaaagaatg cagattttca 60 
gagaatatat aaaaaaggtc attctgtagc caacagacaa tttgttgtat acacttgtaa 120 
taataaagaa atagaccatt ttcgcttagg tattagtgtt tctaaaaaac taggtaatgc 180 
agtgttaaga acaagattaa aagagcaata cgt 213 

<210> 10 
<211> 215 
<212> DNA 

<213> Staphylococcus aureus 
<400> 10 

acgataaata acagtatgtt attggaaaaa gtctaccgaa ttaaaaagaa tgcagatttt 60 
cagagaatat ataaaaaagg tcattctgta gccaacagac aatttgttgt atacacttgt 120 
aataataaag aaatagacca ttttcgctta ggtattagtg tttctaaaaa actaggtaat 180 
gcagtgttaa gaacaagatt aaaagagcaa tacgt 215 

<210> 11 
<211> 170 
<212> DNA 

<213> Staphylococcus aureus 
<400> 11 

gaaaatttca aagtacataa gtcgcatata ttggccaaag attgtaatag caagacagcc 60 
agctaaagat atgacgactt tacaaataca gaatagtctt gagcacgtac ttaaaattgc 120 
caaagttttt aataaaaaga ttaagtaagg atagggtagg gaatgaaaac 170 

<210> 12 



4/5 



WO 00/70025 



PCT/US00/13946 



<211> 93 
<212> DNA 

<213> Staphylococcus aureus 
<400> 12 

gggcgaauuc gccucggcua cguagcucag uugguuagag cacaucacuc auaaugaugg 60 
ggucacaggu ucgaaucccg ucguagccac cag 93 

<210> 13 
<211> 17 
<212> DNA 

<213> Staphylococcus aureus 
<400> 13 

tgatatttct ggtaacc 17 

<210> 14 
<211> 20 
<212> DNA 

<213> Staphylococcus aureus 
<400> 14 

guucugauau uuuggguaau 20 
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1/9 



^i— 125mM NaCI 
250 mM NaCI 



1 2 

[Urea]; mol/l 
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Figure 3 
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Figure 4a 
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